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(57) Abstract: The present invention provides novel genes and proteins for diagnosing ovarian cancer and/or a likelihood for sur- 
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specifically exemplified by the genes and proteins set forth in Tables 1 to 3 and the Sequence Listing. 
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METHODS OF DIAGNOSIS AND PROGNOSIS OF OVARIAN CANCER j 
Field of the invention 

The present invention relates to the identification of nucleic acid and protein expression 
5 profiles and nucleic acids, products, and antibodies thereto that are involved in ovarian 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of ovarian cancer. More particularly, this invention relates to 
novel genes that are expressed at elevated or reduced levels in malignant tissues and 
uses therefor in the diagnosis of cancer or malignant tumors in human subjects. This 
10 invention also relates to the use of nucleic acid or antibody probes to specifically detect 
ovarian cancer cells, such as, for example, in the ovarian surface epithelium, wherein 
over-expression or reduced expression of nucleic acids hybridizing to the probes is highly 
associated with the occurrence and/or recurrence of an ovarian tumor, and/or the 

j 

likelihood of patient survival. The diagnostic and prognostic test of the present invention 
15 is particularly useful for the early detection of ovarian cancer or metastases thereof, or 

other cancers, and for monitoring the progress of disease, such as, for example, during j 
remission or following surgery or chemotherapy. The present invention is also directed f 
to methods of therapy wherein the activity of a protein encoded by a 
diagnostic/prognostic gene described herein is modulated. 

20 I 
Background of the invention ! 

i 

i 

L General j 
As used herein the term "derived from* shall be taken to indicate that a specified integer 
are obtained from a particular source albeit not necessarily directly from that source. 

25 

Unless the context requires otherwise or specifically stated to the contrary, integers, 
steps, or elements of the invention recited herein as singular integers, steps or elements 
clearly encompass both singular and plural forms of the recited integers, steps or 



elements. 



30 



The embodiments of the invention described herein with respect to any single 
embodiment shall be taken to apply mutatis mutandis to any other embodiment of the 
invention described herein. 
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Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to imply 
the inclusion of a stated step or element or integer or group of steps or elements or 
integers but not the exclusion of any other step or element or integer or group of 
elements or integers. 

Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations or any two or more of said steps or features. 

The present invention is not to be limited in scope by the specific examples described 
herein. Functionally equivalent products, compositions and methods are clearly within the 
scope of the invention, as described herein. 

The present invention is performed without undue experimentation using, unless 
otherwise indicated, conventional techniques of molecular biology, microbiology, virology, 
recombining DNA technology, peptide synthesis in solution, solid phase peptide 
synthesis, and immunology. Such procedures are described, for example, in the 
following texts that are incorporated herein by reference: 

1. Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, New York, Second Edition (1989), whole of Vols I, II, 
and III; 

2. DNA Cloning: A Practical Approach, Vols. I and II (D. N. Glover, ed., 1985), IRL 
Press, Oxford, whole of text; 

3. Oligonucleotide Synthesis: A Practical Approach (M. J. Gait, ed., 1984) IRL 
Press, Oxford, whole of text, and particularly the papers therein by Gait, pp1-22; 
Atkinson et at., pp35-81; Sproat etal., pp 83-115; and Wu ef a/., pp 135-151; 

4. Nucleic Acid Hybridization: A Practical Approach (B. D. Hames & S. J. Higgins, 
eds., 1985) IRL Press, Oxford, whole of text; 

5. Perbal, B., A Practical Guide to Molecular Cloning (1 984); 

6. Wunsch, E., ed. (1974) Synthese von Peptiden in Houben-Weyls Metoden der 
Organischen Chemie (Muler, E., ed.), vol. 15, 4th edn., Parts 1 and 2, Thieme, 
Stuttgart. 
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7. Handbook of Experimental Immunology, Vols. I-IV (D. M. Weir and C. C. 
Blackwell, eds., 1986, Blackwell Scientific Publications). 

This specification contains nucleotide and amino acid sequence information prepared 
5 using Patent In Version 3.1 , presented herein after the claims. Each nucleotide sequence 
is identified in the sequence listing by the numeric indicator <210> followed by the 
sequence identifier (e.g. <210>1, <210>2, <210>3, etc). The length and type of 
sequence (DNA, protein (PRT), etc), and source organism for each nucleotide sequence, 
are indicated by information provided in the numeric indicator fields <211>, <212> and 
10 <213> f respectively. Nucleotide sequences referred to in the specification are defined by 
the term "SEQ ID NO:", followed by the sequence identifier (eg. SEQ ID NO: 1 refers to 
the sequence in the sequence listing designated as <400>1). 

The designation of nucleotide residues referred to herein are those recommended by the 
15 IUPAC-IUB Biochemical Nomenclature Commission, wherein A represents Adenine, C 
represents Cytosine, G represents Guanine, T represents thymine, Y represents a 
pyrimidine residue, R represents a purine residue, M represents Adenine or Cytosine, K 
represents Guanine or Thymine, S represents Guanine or Cytosine, W represents 
Adenine or Thymine, H represents a nucleotide other than Guanine, B represents a 
20 nucleotide other than Adenine, V represents a nucleotide other than Thymine, D 
represents a nucleotide other than Cytosine and N represents any nucleotide residue. 

2. Description of the related art 

Cancer is a multi-factorial disease and major cause of morbidity in humans and other 
25 animals, and deaths resulting from cancer in humans are increasing and expected to 
surpass deaths from heart disease in future. Carcinomas of the lung, prostate, breast, 
colon, pancreas, and ovary are major contributing factors to total cancer death in 
humans. For example, prostate cancer is the fourth most prevalent cancer and the 
second leading cause of cancer death in males. Similarly, cancer of the ovary is the 
30 second most common cancer of the female reproductive organs and the fourth most 
common cause of cancer death among females. With few exceptions, metastatic 
disease from carcinoma is fatal. Even if patients survive their primary cancers, 
recurrence or metastases are common. 

35 It is widely recognized that simple and rapid tests for solid cancers or tumors have 
considerable clinical potential. Not only can such tests be used for the early diagnosis of 
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cancer but they also allow the detection of tumor recurrence following surgery and 
chemotherapy. A number of cancer-specific blood tests have been developed which 
depend upon the detection of tumor-specific antigens in the circulation (Catalona, W.J., 
et a/., 1991, "Measurement of prostate-specific antigen in serum as a screening test for 
5 prostate cancer", N. Engl. J. Med. 324, 1156-1161; Barrenetxea, G., et a/., 1998, "Use of 
serum tumor markers for the diagnosis and follow-up of breast cancer", Oncology, 55, 
447-449; Cairns, P., and Sidreansky, D., 1999, "Molecular methods for the diagnosis of 
cancer". Biochim. Biophys. Acta. 1423, C 11-C 18). 

10 Ovarian cancer is the fourth most frequent cause of cancer death in females and in the 
United States, and accounts for approximately 13,000 deaths annually. Furthermore, 
ovarian cancer remains the number one killer of women with gynaecological malignant 
hyperplasia and the incidence is rising in industrialized countries. The etiology of the 
neoplastic transformation remains unknown although there is epidemiological evidence 

15 for an association with disordered endocrine function. The incidence of ovarian 
carcinoma is higher in nulliparous females and in those with early menopause. 

Most ovarian cancers are thought to arise from the ovarian surface of epithelium (OSE). 
Epithelial ovarian cancer is seldom encountered in women less than 35 years of age. Its 

20 incidence increases sharply with advancing age and peaks at ages 75 to 80, with the 
median age being 60 years. The single most important known risk factor is a strong 
familial history of breast or ovarian cancer. To date, little is known about the structure 
and function of the OSE cells. It is known that the OSE is highly dynamic tissue that 
undergoes morphogenic changes, and has proliferative properties sufficient to cover the 

25 ovulatory site following ovulation. Morphological and histochemical studies suggest that 
the OSE has secretory, endocytotic and transport functions which are hormonally- 
controlled (Blausteln and Lee, Oncol. 8, 34-43, 1979; Nicosia and Johnson, Int. J. 
Gynecol. Pathol., 3, 249-260, 1983; Papadaki and Beilby, J. Cell Sci. 8, 445-464, 1971; 
Anderson etal. t J. MorphoL, 150, 135-164, 1976). 

30 

Ovarian cancers are not readily detectable by diagnostic techniques (Siemens et al. f J. 
Cell. Physiol., 134: 347-356, 1988). In fact, the diagnosis of carcinoma of the ovary is 
generally only possible when the disease has progressed to a late stage of development. 
Approximately 75% of women diagnosed with ovarian cancer are already at an advanced 
35 stage (III and IV) of the disease at their initial diagnosis. During the past 20 years, neither 
diagnosis nor five year survival rates have greatly improved for these patients. This is 
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substantially due to the high percentage of high-stage initial detection of the disease. 
There is therefore a need to develop new markers that improve early diagnosis and 
thereby reduce the percentage of high-stage initial diagnoses. 

5 A number of proteinaceous ovarian tumor markers were evaluated several years ago, 
however these were found to be non-specific, and determined to be of low value as 
markers for primary ovarian cancer (Kudlacek et a/., Gyn. One. 35, 323-329, 1989; 
Rustin et a/., J. Clin. One, 7, 1667-1671, 1989; Sevelda et a/., Am. J. Obstet Gynecol., 
161, 1213-1216, 1989; Omar et a/., Tumor Biol., 10, 316-323, 1989). Several 

10 monoclonal antibodies were also shown to react with ovarian tumor associated antigens, 
however they were not specific for ovarian cancer and merely recognize determinants 
associated with high molecular weight mucin-like glycoproteins (Kenemans etal. t Eur. J. 
Obstet. Gynecol. Repod. BioL 29, 207-218, 1989; McDuffy, Ann. Clin. Biochem., 26, 379- 
387, 1989). More recently, oncogenes associated with ovarian cancers have been 

15 identified, including HER-2/neu (c-erbB-2) which is over-expressed in one-third of ovarian 
cancers (USSN 6,075,122 by Cheever et a/, issued June 13, 2000), the fms oncogene, 
- and abnormalities in the p53 gene, which are seen in about half of ovarian cancers. 

Whilst previously identified markers for carcinomas of the ovary have facilitated efforts to 
20 diagnose and treat these serious. diseases, there is a clear need for the identification of 
additional markers and therapeutic targets. The identification of tumor markers that are 
amenable to the early-stage detection of localized tumors is critical for more effective 
management of carcinomas of the ovary. 

25 Summary of the invention 

In work leading up to the present invention, the inventors sought to identify nucleic acid 
markers that were diagnostic of ovarian cancers generally, or diagnostic of specifc 
ovarian cancers such as, for example, serous ovarian cancer (SOC), mucinous ovarian 
cancer (MOC), non-invasive (borderline ovarian cancer or low malignant potential 

30 ovarian cancer), mixed phenotype ovarian cancer, endometrioid ovarian cancer (EnOC) 
and clear cell ovarian cancer (CICA), papillary serous ovarian cancer, Brenner cell or 
undifferentiated adenocarcinoma, by virtue of their modulated expression in cancer 
tissues derived from a patient cohort compared to their expression in healthy or non- 
cancerous cells and tissues. Additionally, the inventors sought to determine whether any 

35 correlation exists between the expression of any particular gene in a subject having 
ovarian cancer and the survival, or likelihood for survivial, of the subject during the 
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medium to long term (i.e. in the period between about 1-2 years from primary diagnosis, 
or longer). The inventors also sought to to determine whether any correlation exists 
between the expression of any particular gene in a subject following treatment for ovarian 
cancer and the recurrence, or likelihood for recurrence, of ovarian cancer in the subject 
5 during the medium to long term (i.e. in the period between about 1-2 years from primary 
diagnosis, or longer). 



As exemplified herein, the inventors identified a number of genes whose expression is 
altered (up-regulated or down-regulated) in individuals with ovarian cancer compared to 
10 healthy individuals., eg., subjects who do not have ovarian cancer. The particular genes 
are identified in Tables 1 and 2. Preferably, the genes are selected from the group of 
candidate genes set forth in Table 3. 

The list of genes and proteins exemplified herein by Table 1 were identified by a 
15 statistical analysis as outlined in the examples which gave a P-value, eg., by comparison 
of expression to the expression of that gene in normal ovaries. 

Accordingly, one aspect of the present invention provides a method of detecting an 
ovarian cancer-associated transcript in a biological sample, the method comprising 
20 contacting the biological sample with a polynucleotide that selectively hybridizes to a . 
sequence at least 80% identical to a sequence as shown in Table 1 or 2 or 3. Preferably 
the percentage identity to a sequence disclosed in any one of Tables 1-3 is at least 
about 85% or 90% or 95%, and still more preferably at least about 98% or 99%. 

25 In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a modified level of hybridization of the probe for the subject being 

30 tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of. 
(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 

35 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 

59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 
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(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NQs: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39. 41, 43, 45. 46, 48, 50, 52, 53, 55, 57, 59, 61. 63, 65, 67, 

' 5 69, 71 , 73, 75. 77, 79. 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23. 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

10 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 30, 32, 
34, 36. 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76. 
78, 80, 82 and 84; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 

15 (ii) or (iii) or (iv). 

In a preferred embodiment, the present Invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
20 for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a modified level of hybridization of the probe for the subject being 
tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of. 
25 (i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53. 55, 57, 
59, 61 , 63, 65, 67, 69, 71 , 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
30 to at least about 20 contiguous nucleotides from a sequence selected from the 

group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71 , 73, 75, 77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
35 group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21. 23. 25, 27, 
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29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71,73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 

5 , 34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 

78, 80, 82 and 84; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (Hi) or (iv). 

10 Even more preferably, the present invention provides a method of diagnosing an ovarian 
cancer in a human or animal subject being tested said method comprising contacting a 
biological sample from said subject being tested with a nucleic acid probe for a time and 
under conditions sufficient for hybridization to occur and then detecting the hybridization 
wherein a modified level of hybridization of the probe for the subject being tested 

15 compared to the hybridization obtained for a control subject not having ovarian cancer 
indicates that the subject being tested has an ovarian cancer, and wherein said nucleic 
acid probe comprises a sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 

20 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 

67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

25 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

30 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84; and 

35 (v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 
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As used herein, the term "modified level" includes an enhanced, increased or elevated 
level of an integer being assayed, or alternatively, a reduced or decreased level of an 
integer being assayed. 

In one embodiment an elevated, enhanced or increased level of expression of the nucleic 
acid is detected. In accordance with this embodiment, the present invention provides a 
method of diagnosing an ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
then detecting the hybridization wherein an enhanced level of hybridization of the probe 
for the subject being tested compared to the hybridization obtained for a control subject 
not having ovarian cancer indicates that the subject being tested has an ovarian ovarian 
cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 or 2 other than a nucleic acid having an Accession 
Number selected from the group consisting of NM_022117, NMJ)05460, 
NMJ)02387, A1745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NMj)22117, NM_005460, NM_002387, AI745249 and 
AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NM_022117, NM_005460, NM_002387, AI745249 and 
AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein an enhanced level of hybridization of the probe for the subject 
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being tested compared to the hybridization obtained for a control subject not having 
ovarian cancer indicates that the subject being tested has an ovarian cancer, and 
wherein said nucleic acid probe comprises a sequence selected from the group 
consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
.. to at least about 20 contiguous nucleotides from a sequence selected from the 

group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53,.55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; 
and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(fi) or (iii) or (iv). 

In an alternative preferred embodiment, a reduced level of a diagnostic marker is 
indicative of ovarian cancer. In accordance with this embodiment, the present invention 
provides a method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a reduced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 
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(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
add set forth in Table 1 and having an Accession Number selected from the group 
consisting of NM_022117, NMJ)05460, NMJ)02387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
5 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

and having an Accession Number selected from the group consisting of 
NMJ322117, NM_005460, NM_002387, AI745249 and AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
10 Table 1 and having an Accession Number selected from the group consisting of 

NM j)221 1 7, NM_005460, NM J)02387, AI745249 and AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15 In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a reduced level of hybridization of the probe for the subject being 
20 tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of: 
(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, and 5; 
25. (ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1 , 3, and 5; 
(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, and 5; 
30 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4 f and 6; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 



Preferably, the ovarian cancer that is diagnosed according to the present invention is an 
epithelial ovarian cancer, such as, for example, serous ovarian cancer, non-invasive 
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ovarian cancer, mixed phenotpye ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell or undifferentiated adenocarcinoma. As will be apparent from the preferred 
embodiments described below, certain of the genes represented in Table 1, Table 2 and 
5 Table 3 are expressed at modified levels in subjects having serous or mucinous ovarian 
cancers. Data presented in Figures 1-4 also exemplifiy novel diagnostics and 
prognostics for serous ovarian cancer,, mucinous ovarian cancer, endometrioid ovarian 
cancer and clear cell ovarian cancer. 

10 As exemplified herein by Table 2, the present inventors have identified those genes 
having an elevated or reduced average ratio of expression of specific genes between 
ovarian cancer patients vs non-ovarian cancer patients, wherein a high ratio in Table 2 
indicates an enhanced expression in an ovarian cancer patients and wherein a negative 
ratio indicates that a reduced expression in an ovarian cancer patient. 

15 

In an alternative preferred embodiment, the present invention provides a method of 
diagnosing a serous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
20 then detecting the hybridization wherein a modified level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a serous ovarian 
cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

25 (i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 2 or as set forth in Table 1 and having an Accession 
Number selected from the group consisting of: U62801, D49441, X51630, and 
AB018305; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
30 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 2 

or as set forth in Table 1 and having an Accession Number selected from the 
group consisting of: U62801, D49441, X51630, And AB018305; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
35 Table 2 or as set forth in Table 1 and having an Accession Number selected from 

the group consisting of: U62801, D49441, X51630, And AB018305; and 
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(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

In a further alternative preferred embodiment the present invention provides a method of 
5 diagnosing a mucinous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
then detecting the hybridization wherein an elevated level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
10 having ovarian cancer indicates that the subject being tested has a mucinous ovarian 
. cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 

15 consisting of: IMM_006149, AA315933, U47732, NMJW5588, AW503395, 

NM_004063, AI073913, AI928445, NM_022454, W40460, AA132961 and 
AF111856; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

20 and having an Accession Number selected from the group consisting of: 

NMJJ06149, AA315933, U47732, NM_005588, AW503395, NM_004063, 
AI073913, AI928445, NM_022454, W40460, AA1 32961 and AF111856; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting of: 

NM_006149, AA315933, U47732, NM_005588, AW503395, NMJ)04063, 
AI073913, AI928445, NM_022454, W40460, AA132961 and AF1 11856; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

30 

In a preferred embodiment, the present invention provides a method of diagnosing a 
mucinous ovarian cancer in a human or animal subject being tested said method 
comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
35 detecting the hybridization wherein an enhanced level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
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having ovarian cancer indicates that the subject being tested has an ovarian cancer, and 
. wherein said nucleic acid probe comprises a sequence selected from the group 
consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from SEQ ID NO: 
5 57 or 59 or 61; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from SEQ ID NO: 57 or 59 or 61 ; 

(iii) a sequence that is at least about 80% identical to SEQ ID NO: 57 or 59 or 61 ; 

(iv) a sequence that encodes the amino acid sequence set forth in SEQ ID NO: 58 or 
10 60 or 62; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

Those skilled in the art will be aware that as a carcinoma progresses, metastases occur 
15 in organs and tissues outside the site of the primary tumor. For example, in the case of 
ovarian cancer, metastases commonly appear in a tissue selected from the group 
consisting of omentum, abdominal fluid, lymph nodes, lung, liver, brain, and bone. 
Accordingly, the term "ovarian cancer" as used herein shall be taken to include an early 
or developed tumor of the ovary, such as, for example, any one or more of a number of 
20 cancers of epithelial origin, such as serous, mucinous, endometrioid, clear cell, papillary 
serous, Brenner cell or undifferentiated adenocarcinoma, non-invasive ovarian cancer 
such as borderline ovarian cancer or low-malignant potential ovarian cancer, or a mixed 
phenotype ovarian cancer, and optionally, any metastases outside the ovary that occurs 
in a subject having a primary tumor of the ovary. 

25 

As used herein, the term "diagnosis", and variants thereof, such as, but not limited to 
"diagnose", "diagnosed" or "diagnosing" shall not be limited to a primary diagnosis of a 
clinical state, however should be taken to include any primary diagnosis or prognosis of a 
clinical state. For example, the "diagnostic assay" formats described herein are equally 
30 relevant to assessing the remission of a patient, or monitoring disease recurrence, or 
tumor recurrence, such as following surgery or chemotherapy, or determining the 
appearance of metastases of a primary tumor. All such uses of the assays described 
herein are encompassed by the present invention. 



35 Both classical hybridization and amplification formats, and combinations thereof, are 
encompassed by the invention. In one embodiment, the hybridization comprises 
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performing a nucleic acid hybridization reaction between a labeled probe and a second 
nucleic acid in the biological sample from the subject being tested, and detecting the 
label. In another embodiment, the hybridization comprising performing a nucleic acid 
amplification reaction eg., polymerase chain reaction (PGR), wherein the probe consists 
5 of a nucleic acid primer and nucleic acid copies of the nucleic acid in the biological 
sample are amplified. As will be known to the skilled artisan, amplification may proceed 
classical nucleic acid hybridization detection systems, to enhance specificity of detection, 
particularly in the case of less abundant rnRNA species in the sample. 

10 In a preferred embodiment, the polynucleotide is immobilised on a solid surface. 

The present invention clearly encompasses nucleic acid-based methods and protein- 
based methods for diagnosing cancer in humans and other mammals. 

15 Accordingly, in a related embodiment, the present invention provides a method of 
detecting an ovarian cancer-associated polypeptide in a biological sample the method 
comprising contacting the biological sample with an antibody that binds specifically to an 
ovarian cancer-associated polypeptide in the biological sample, the polypeptide being 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% 

20 identical to a sequence as shown in Tables 1-3. 

Preferably the percentage identity to a sequence disclosed in any one of Tables 1-3 is at 
least about 85% or 90% or 95%, arid still more preferably at least about 98% or 99%. 

25 In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a modified level of the antigen-antibody complex for the 

30 subject being tested compared to the amount of the antigen-antibody complex formed for 
a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 

35 consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
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36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84. 

In a preferred embodiment, the present invention provides a method of diagnosing an 
5 ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a modified level of the antigen-antibody complex for the 
subject being tested compared to.the amount of the antigen-antibody complex formed for 

10 a control subject not having ovarian cancer indicates that the subject being tested has ah 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 

15 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

In one embodiment an elevated, enhanced or increased level of expression of the 
antigen-antibody complex is detected. In accordance with this embodiment, the present 
invention provides a method of diagnosing an ovarian cancer in a human or animal 

20 subject being tested said method comprising contacting a biological sample from said 
subject being tested with an antibody for a time and under conditions sufficient for an 
antigen-antibody complex to form and then detecting the complex wherein an enhanced 
level of the antigen-antibody complex for the subject being tested compared to the 
amount of the antigen-antibody complex formed for a control subject not having ovarian 

25 cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NMJ)22117, NMJ)05460, NMJ)02387, AI745249 and AI694200. 

30 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
35 detecting the complex wherein an enhanced level of the antigen-antibody complex for the 
subject being tested compared to the amount of the antigen-antibody complex formed for 
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a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

In an alternative preferred embodiment, a reduced level of a diagnostic marker is 
indicative of ovarian cancer. In accordance with this embodiment, the present invention 
provides a method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a reduced level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
the subject being tested has an ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 
having an Accession Number selected from the group consisting of NM_022117, 
NM_005460, NM_002387, AI745249 and AI694200. 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a reduced level of the antigen-antibody complex for the 
subject being tested compared to the amount of the antigen-antibody complex formed for 
a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6. 

Preferably, the ovarian cancer that is diagnosed according to the present invention is an 
epithelial ovarian cancer, such as, for example, serous ovarian cancer or mucinous 
ovarian cancer. 
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In an alternative preferred embodiment, the present invention provides a method of 
diagnosing a serous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with an 
5 antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein a modified level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a serous ovarian cancer, and wherein said antibody binds to a 
10 polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 2 or as 
set forth in Table 1 and having an Accession Number selected from the group consisting 
of: U62801 , D49441 , X51 630, And AB01 8305. 

15 In a further alternative preferred embodiment, the present inventbn provides a method of 
diagnosing a mucinous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein a reduced level of the antigen-antibody 

20 complex for the subject being tested compared to the amount of the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a mucinous ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 

25 having an Accession Number selected from the group consisting of: NM_006149, 
AA315933, U47732, NM_005588, AW503395, NMJ)04063, AI073913, AI928445, 
NM_022454, W40460, AA1 32961 and AF111856. 

In a preferred embodiment, the present invention provides a method of diagnosing a 
30 mucinous ovarian cancer in a human or animal subject being tested said method 
comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein an enhanced level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
35 complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a mucinous ovarian cancer, and wherein said antibody binds to a 
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polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to SEQ ID NO: 58 
or 60 or 62. 

5 In a further related embodiment, the present invention provides a method of detecting an 
ovarian cancer-associated antibody in a biological sample the method comprising 
contacting the biological sample with a polypeptide encoded by a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables 1-3, wherein the polypeptide specifically binds to the ovarian cancer-associated 
10 antibody. 

Preferably, in the above methods, the biological sample is contacted with a plurality of 
the polynucleotides, polypeptides or antibodies referred to above. 

15 In a particularly preferred embodiment, the present invention provides an antibody-based 
mulptiplex assay for determing the likelihood of survivial of a subject from an ovarian 
cancer. In one embodiment, the invention provides a method of determining the 
likelihood of survival of a subject suffering from a serous ovarian cancer, said method 
comprising contacting a biological sample from said subject being tested with at least two 

20 antibodies for a time and under conditions sufficient for antigen-antibody complexes to 
form and then detecting the complexes wherein an enhanced level of the antigen- 
antibody complexes for the subject being tested compared to the amount of the antigen- 
antibody complexes formed for a control subject not having ovarian cancer indicates that 
the subject being tested has a poor probability of survival, and wherein one antibody 

25 binds to an sFRP polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO: 72 and wherein one antibody binds to a SOCS3 polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO: 74. 

The present invention is not to be limited by the source or nature of the biological 
30 ' sample. In one embodiment, the biological sample is from a patient undergoing a 
therapeutic regimen to treat ovarian cancer. In an alternative preferred embodiment, the 
biological sample is from a patient suspected of having ovarian cancer. 

In addition to providing up-regulated and down-regulated genes, the list of genes and 
35 proteins exemplified herein by Table 1 were identified by a statistical analysis as outlined 
in the examples which gave a P-value, eg., by comparison of expression to 
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clinicopathological parameters for disease recurrence or patient survival. Accordingly, 
the present invention is particularly useful for prognostic applications, in particular for 
assessing the medium-to-long term survival of a subject having an ovarian cancer, or 
alternatively or in addition, for assessing the likelihood of disease recurrence. 

Accordingly, a further aspect of the present invention provides a method of monitoring 
the efficacy of a therapeutic treatment of ovarian cancer, the method comprising: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated transcript in the 
biological sample by contacting the biological sample with a polynucleotide 
that selectively hybridizes to a sequence having at least about 80% identity 
to a sequence as shown in any one of Tables 1-3, thereby monitoring the 
efficacy of the therapy. 

Preferably the method further comprises comparing the level of the ovarian 
cancer-associated transcript to a level of the ovarian cancer-associated transcript in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment 

In a related embodiment, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of ovarian cancer, the method comprising : 

(i) providing a biological sample from a patient undergoing the. therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated antibody in the 
biological sample by contacting the biological sample with a polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1-3, wherein the 
polypeptide specifically binds to the ovarian cancer-associated antibody, 
thereby monitoring the efficacy of the therapy. 



Preferably the method further comprises comparing the level of the ovarian 
cancer-associated antibody to a level of the ovarian cancer-associated antibody in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment. 
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In a further related embodiment, the present invention provides a method of monitoring 
the efficacy of a therapeutic treatment of ovarian cancer, the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 

treatment; and 

5 (ii) determining the level of a ovarian cancer-associated polypeptide in the 

biological sample by contacting the biological sample with an antibody, wherein 
the antibody specifically binds to a polypeptide encoded by a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as 
shown in Tables 1-3, thereby monitoring the efficacy of the therapy. 

10 

Preferably the method further comprises comparing the level of the ovarian 
cancer-associated polypeptide to a level of the ovarian cancer-associated polypeptide in 
a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

15 It will also be apparent from the following preferred embodiments, that the expression of 
certain genes listed in Table 1 and Table 3 is statistically correlated with survival and 
death of patients having ovarian cancer, wherein a low P value indicates an enhanced 
likelihood that a patient having altered expression of the gene will die from the cancer. 

20 Accordingly, in one embodiment, the present invention provides a method of determining 
the likelihood of survival of a subject suffering from an ovarian cancer, said method 
comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 

25 subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
30 acid set forth in Table 1 and having an Accession Number selected from the group 

consisting of: NM_003014, AA046217, NMJ)15902, T83882, AB040888, 
AA628980, AI623351, AW614420, AA243499, AF251237, AI970797, AF145713, 
X78565, T97307, BE243845, AW068302, AL133561, BE313555, X07820, 
AI973016, AF084545, U41518, Z11894, AW138190, BE086548, W47196, 
35 AI796870, X02761, AW968613, AW972565, AF045229, AW953853, U52426,. 

F06700, AI798863, H52761, BE546947, AU076643, U20536, AA581602, 
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AJ245210, X65965, AI806770, BE386490. AW581992, U77534, AL034417, 
L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 
BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NM_014992, 
BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
5 BE614567, AL079658, NM_002776, BE261944, NM_006379, A1002238, X81789, 

NM_002122, AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
N72403, BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, 
NM_001 955, AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(ii) a sequence that hybridizes under, at least low stringency hybridization conditions 
10 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AJ970797, AF145713, X78565, T97307, 
BE243845, AW068302, AL1 33561, BE313555, X07820, AI973016, AF084545, 

15 U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 

AW968613, AW972565, AF045229. AW953853, U52426, F06700, AI798863, 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 
AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944, 
W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 

20 M90464, AA829286, A1333771, BE465867, NM_014992, BE616902, AA430373, 

R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 
AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
BE003054, AL035588, AJ080491, AW770994, H24177, AF146761, NM_001955, 

25 AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 

30 AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, 

BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 
AW968613. AW972565, AF045229, AW953853, U52426, F06700, AI798863, 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 

35 AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944, 

W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 
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M90464, AA829286, AI333771, BE465867, NMJJ14992, BE616902, AA430373, 
R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NM_002776, BE261944, NM_006379, AI002238, X81789, NMJD02122, 
AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
5 BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, NMJ)01955, 

AI680737, AI752666, AA505445, BE246649, and NM J)03955; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (Hi) or (iv). 

10 In a preferred embodiment, the present invention provides a method of determining the 
likelihood of survival of a subject suffering from an ovarian cancer, said method 
comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 

15 subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
20 selected from the group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 

79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

25 (iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

30 

In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from an ovarian cancer, said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
35 form and then detecting the complex wherein an enhanced level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
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complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has has a poor probability of survival, and wherein said antibody binds to a 
polypeptide comprising an amino add sequence comprising at least about 10 contiguous 
amino acid residues of a sequence encoded by a nucleic acid set forth in Table 1 and 
5 having an Accession Number selected from the group consisting of: NM__003014, 
AA046217, NM_015902, T83882, AB040888, AA628980, AI623351. AW614420, 
AA243499, AF251237, AI970797, AF145713, X78565, T97307, BE243845, AW068302, 
AL133561, BE313555, X07820, AI973016, AF084545, U41518, Z11894, AW138190, 
BE086548, W47196, AI796870, X02761, AW968613, AW972565, AF045229, 

10 AW953853, U52426, F06700, AI798863, H52761, BE546947, AU076643, U20536, 
AA581602, AJ245210, X65965, AI806770, BE386490, AW581992, U77534, AL034417, 
L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, BE466173, 
AI354722, M90464, AA829286; AI333771, BE465867, NM_014992, BE616902, 
AA430373, R27430, BE387335, AW264102, AW952323, AA088177, BE614567, 

15 AL079658, NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 
AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
BE003054, AL035588, A1080491, AW770994, H24177, AF1 46761, NM_001955, 
AI680737, AI752666, AA505445, BE246649, and NM_003955. 

20 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from an ovarian cancer, said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen : antibody complex to 
form and then detecting the complex wherein an enhanced level of the antigen-antibody 

25 complex for the subject being tested compared to the amount of. the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has has a poor probability of survival, and wherein said antibody binds to a 
polypeptide comprising an amino add sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to a sequence 

30 selected from the group consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84. 

In a particularly preferred embodiment, the present invention provides a marker for 
determining the likelihood of a subject surviving from serous cancer. In accordance with 
35 this embodiment of the invention, there is provided a method of determining the 
likelihood of survival of a subject suffering from a serous ovarian cancer, said method 
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comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 
subject being tested compared to the hybridization obtained for a control subject not 
5 having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 

acid comprising the nucleotide sequence set forth in SEQ ID NO: 71 or 73; 
10 (ii) a sequence that hybridizes under at least low stringency hybridization conditions 

to at least about 20 contiguous nucleotides from a nucleic acid comprising the 

nucleotide sequence set forth in SEQ ID NO: 71 or 73; 
(iii) a sequence that is at least about 80% identical to (i) or (ii) and encoding an sFRP 

protein or a SOCS3 protein; 
15 (iv) a sequence that encodes a polypeptide comprising the amino acid sequence set 

forth in SEQ ID NO: 72 or 74; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 

(ii) or (iii) or (iv). 

20 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from a serous ovarian cancer, 
said method comprising contacting a biological sample from said subject being tested 
with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

25 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a poor probability of survival, and wherein said 
antibody binds to an sFRP polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO: 72 or a SOCS3 polypeptide comprising the amino acid sequence set forth in 

30 SEQ ID NO: 74 or. 

It will also be apparent from the following preferred embodiments, that the expression of 
certain genes listed in Table 1 and Table 3 is statistically correlated with recurrence of 
ovarian cancer, wherein a low P value indicates an enhanced likelihood that a patient 
35 having altered expression of the gene will experience recurrence of the disease. 
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In yet another preferred embodiment, the present invention provides a method of 
determining the likelihood that a subject will suffer from a recurrence of an ovarian 
cancer, said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a high probability of recurrence, and wherein said nucleic acid probe 
comprises a sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, 
R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF126245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 
AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 
AA188763, AK000596. AI970797, AW519204, Z42387, AF145713, AA972412, 
AK001564, AW959861, BE313555, W25005, AI193356, AF111106, A1130740, 
AA985190. BE221880, AF084545, R26584, AW247380, AA364261. U25849, 
AF262992, AW342140, AL133572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA1 30986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, 
AW462948, AW470411, AW885727, AW970859, AW979189, BE165866, 
BE175582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584. R31178, W05391, W25005, W45393, W68815, X65965, 
X76732andZ45051, 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: M86849, 
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AW963419, BE298665, AK000637, BE077546, T97307, R24601, BE090176, 
AA393907, W28729, BE313754, AW673081, AA356694, L08239, BE397649, 
NM_012317, NM_000947,. AJ250562. AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, NMJJ02514, 
5 AI863735, NM..005397, W26391, H15474, U51166, AA243499, AW408807, 

AI738719, AB040888, BE313077, AI677897, C14898, AI821730, AF007393, 
H65423. N46243, AA095971, U20350, NM_005756, D19589, AW957446, 
AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, BE464341, 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 

10 AU076611, AW968613, AL353944, BE614149, AA292998, H12912, AA188763, 

AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, AK001564, 
AW959861, BE313555, W25005, AI193356, AF111106, AI130740, AA985190, 
BE221880, AF084545, R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL133572, AI497778, AI745379, U51712, AW375974, AF251237, 

15 NM_000636, AA130986, AA216363, AA628980, AA811657, AA897108, 

AB040888, AF212225, AI089575, AJ282028, AI368826, AI718702, AI827248, 
AK002039, AL1 09791, AW090198, AW296454, AW445034, AW452948, 
AW470411, AW885727, AW970859, AW979189, BE165866, BE1 75582, 
BE242587, BE271927, BE439580. BE464016, D63216, F34856, M83822, 

20 N33937, N49068, N51357, N80486, NM_000954, NM_005756, NM_016652, 

R26584, R31178, W05391, W25005, W45393, W68815, X65965, X76732 and 
Z45051; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting 

of:M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 
BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573. 
BE564162, BE439580, AW067800, AA569756, AW138190, AF126245, L10343, 

30 NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 

AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423. N46243, AA095971, U20350, NM_005756, D19589. 
AW957446, AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 

35 AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 

AA188763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 
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AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA1.30986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
AI827248, AK002039, AL109791, AW090198, AW296454, AW445034, 
AW452948, AW470411, AW885727, AW970859, AW979189, BE165866, 
BE1 75582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 



X76732andZ45051;and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ji) or (Hi) or (iv). 

15 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood that a subject will suffer from a recurrence of an ovarian 
cancer, said method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

20 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a high probability of recurrence, and wherein 
said antibody binds to a polypeptide comprising an amino acid sequence comprising at 
least about 10 contiguous amino acid residues of a sequence encoded by a nucleic acid 

25 set forth in Table 1 and having an Accession Number selected from the group consisting 
of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 
BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649. NM_012317, NM_000947, AJ250562, AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF126245, L10343, NM_002514, 

30 AI863735, NM_005397, W26391, H15474, U51166, AA243499, AW408807, AI738719, 
AB040888, BE313077, AI677897, C14898, AI821730, AF007393, H65423, N46243, 
AA095971, U20350, NM_005756, D19589, AW957446, AW294647, BE159718, 
AI888490, AA022569, BE1 47740, AI798863, BE464341, AL080235, AI557212, X75208, 
AA628980, BE242587, NM_005512, AW953853, AU076611, AW968613, AL353944, 

35 BE614149, AA292998, H12912, AA188763, AK000596, AI970797, AW519204, Z42387, 
AF145713, AA972412, AK001564, AW959861, BE313555, W25005, AI193356, 
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AF111106, AI130740, AA985190, BE221880, AF084545, R26584, AW247380, 
AA364261, U25849, AF262992, AW342140, AL133572, AI497778, AI745379, U51712, 
AW375974, AF251237, NMJJ00636, AA130986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
5 AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, AW452948, 
AW470411, AW885727, AW970859, AW979189, BE165866, BE175582, BE242587, 
BE271927, BE439580, BE464016, D63216, F34856, M83822, N33937, N49068, 
N51357, N80486, NM_000954, NMJ)05756, NM__016652, R26584, R31178, W05391, 
W25005, W45393, W6881 5, X65965, X76732 and Z45051 . 

10 

The recurrence of ovarian cancer is a clinical recurrence as determined by the presence 
of one or more clinical symptoms of an ovarian cancer, such as, for example, a 
metastases, or alternatively, as determined in a biochemical test, immunological test or 
serological test such as, for example, a cross-reactivity in a biological sample to a CA125 
15 antibody. 

Preferably, the recurrence is capable of being detected at least about 2 years from 
treatment, more preferably about 2-3 years from treatment, and even more preferably 
about 4 or 5 or 1 0 years from treatment 

20 

Preferably, in the above diagnostic and/or prognostic methods, the biological sample is 
contacted with a plurality of the nucleic acids and/or polypeptides and/or antibodies 
referred to above. In a particularly preferred embodiment, mulpiplex assays are 
performed to detect enhanced expression at least of sFRP4 and SOC3 at the protein 
25 level (eg., using antigen-based or antibody-based assays) or at the mRNA level (eg., by 
detecting elevated levels of mRNA transcripts). 

A further embodiment of the present invention provides a method of diagnosing epithelial 
ovarian cancer by detecting aberrant methylation of a promoter that regulates expression 

30 of a tumor suppressor gene eg., MCC. In particular, the present invention contemplates 
the detection of hypermethylation of the promoter of a tumor suppressor gene. Without 
being bound by any theory or mode of action, such hypermethylation leads to gene . 
inactivation, thereby reducing expression fo the tumor suppressor gene and permitting 
oncogenesis. In one preferred embodiment, the present invention provides a method of 

35 diagnosing an ovarian cancer in a human or animal subject being tested said method 
comprising determining aberrant methylation in a promoter sequence that regulates 
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expression of a tumor suppressor gene in a biological sample from said subject, 
compared to the methylation of the promoter in nucleic acid obtained for a control subject 
not having ovarian cancer wherein said aberrant methylation indicates that the subject 
being tested has an ovarian ovarian cancer. 

5 

In a further aspect, the present invention provides a method for identifying a compound 
that modulates an ovarian cancer-associated polypeptide, the method comprising : 

. (i) contacting the compound with a ovarian cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 
10 at least 80% identical to a sequence as shown in Tables 1-3; and 

(ii) determining the functional effect of the compound upon the polypeptide. 

The functional effect may, for example, be a physical effect or a chemical effect. In one 
embodiment, the functional effect is determined by measuring ligand binding to the 
15 polypeptide. In a particular embodiment the polypeptide is expressed in a eukaryotic 
host cell or cell membrane. Preferably the polypeptide is recombinant. 

In another aspect, the present invention provides a method of inhibiting proliferation of a 
ovarian tumour cell, which method comprises contacting said cell with a compound 
20 identified using the method supra for identifying a compound that modulates an ovarian 
cancer-associated polypeptide. 

In a further aspect, the present invention provides a method of inhibiting proliferation of a 
ovarian cancer-associated cell to treat ovarian cancer in a patient, the method 
25 comprising the step of administering to the patient a therapeutically effective amount of a 
compound identified using the method supra for identifying a compound that modulates 
an ovarian cancer-associated polypeptide. 

In a further aspect, the present invention provides a drug screening assay comprising : 
30 (i) administering a test compound to a mammal having ovarian cancer or a cell 

isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-3 in a treated cell or mammal with the level of gene expression of the 
35 polynucleotide in a control cell or mammal, wherein a test compound that 
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modulates the level of expression of the polynucleotide is a candidate for the 
treatment of ovarian cancer. 

Typically, the control is a mammal with ovarian cancer or a cell therefrom that has not 
5 been treated with the test compound. Alternatively, the control is a normal cell or 
mammal. 

The present invention also provides a method for treating a mammal having ovarian 
cancer comprising administering a compound identified the drug screening method 
10 supra. 

In a further aspect, the present invention provides a pharmaceutical composition for use 
in treating a mammal having ovarian cancer, the composition comprising a compound 
identified the screening method supra for identifying a compound that modulates an 
15 ovarian cancer-associated polypeptide, or alternatively, using the drug screening method 
supra, and a physiologically acceptable carrier or diluent. 

In a further aspect, the present invention provides an assay device, preferably for use in 
the diagnosis or prognosis of ovarian cancer, said device comprising a plurality of 
20 polynucleotides immobilized to a solid phase, wherein each of said polnucleotldes 
consists of a gene as listed in any one of Tables 1-3. Preferably, the solid phase is a 
substantially planar chip. 

In a related embodiment, the present invention provides an assay device, preferably for 
25 use in the diagnosis or prognosis of ovarian cancer, said device comprising a plurality of 
different antibodies immobilized to a solid phase, wherein each of said antibodies binds 
to a polypeptide listed in Tables 1-3. Preferably, the solid phase is a substantially planar 
chip. 

30 Preferably, the assay device supra is used in a method of diagnosis or prognosis as 
described herein. 

Alternatively, the assay device is used to identify modulatory compounds of the 
expression of one or more genes/proteins listed in any one of Tables 1-3. 

35 
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In a further aspect, the present invention provides a non-human transgenic animal which 
is transgenic by virtue of comprising a gene set forth in any one of Tables 1-3 and, in 
particular, to the use of any such transgenic animal in the performance of a diagnostic or 
prognostic method of the invention as transgenic "knock-out" animals that have disrupted 
5 expression of a gene as set forth in any one of Tables 1 -3. 

In a further aspect, the present invention provides an isolated polynucleotide selected 
from the group consisting of; 

(a) polynucleotides comprising a nucleotide sequence as shown in Tables 1-3, 
10 or the complement thereof; 

(b) polynucleotides comprising a nucleotide sequence capable of selectively 
hybridizing to a nucleotide sequence as shown in Tables 1-3; 

(c) polynucleotides comprising a nucleotide sequence capable of selectively 
hybridizing to the complement of a nucleotide sequence as shown in Tables 1-3; 

15 and 

(d) polynucleotides comprising a polynucleotide sequence which is degenerate 
as a result of the genetic code to the polynucleotides defined in (a), (b) or (c). 

Preferred polynucleotides comprise a polynucleotide sequence as shown in Tables 1-3 
20 or a sequence having at least 80% homology thereto. 

Preferably, the isolated polynucleotide is used for the diagnosis or prognosis of ovarian 
cancer, more preferably by a method as described herein. . In a particularly preferred 
embodiment, the present invention provides for the use of a polynucleotide as set forth in 
25 any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

The present invention also provides a nucleic acid vector comprising a polynucleotide of the 
invention. In one embodiment, the polynucleotide is operably linked to a regulatory 
30 control sequence capable of directing expression of the polynucleotide in a host cell. In a 
particularly preferred embodiment, the present invention provides for the use of a vector 
comprising a polynucleotide as set forth in any one of Tables 1-3 in the diagnosis or 
prognosis of ovarian cancer or for the preparation of a medicament for the treatment of 
ovarian cancer. 

35 
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The present invention further provides a host cell comprising a vector as described in the 
preceding paragraph. In a particularly preferred embodiment, the present invention 
provides for the use of a host cell comprising an introduced polynucleotide as set forth in 
any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
5 preparation of a medicament for the treatment of ovarian cancer. 

In a further aspect, the present invention provides an isolated polypeptide which is encoded 
by a gene set forth in any one of Tables 1-3. The present invention also provides an 
isolated polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 
10 at least 80% identical to a sequence as shown in Tables 1-3. In a particularly preferred 
embodiment, the present invention provides for the use of an isolated polypeptide as set 
forth in any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

15 In a further aspect the present invention provides an antibody that binds specifically a 
polypeptide listed in Tables 1-3. In a particularly preferred embodiment, the present 
invention provides for the use of an antibody that binds to an isolated polypeptide as set 
forth in any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

20 

Brief description of the Drawings 

Figure 1 is a photographic representation showing expression of genes as identified by 
immunohistochemical staining of fixed normal (i.e. non-cancerous or healthy) tissues 
25 (panel A) or ovarian cancer tissue (panel B). The inset in panel A shows inclusion cysts. 
The expression levels of the following genes listed in Table 1 or Table 3 were 
detenmined: Claudin-3 (SEQ ID NO: 15); EP-CAM (Accession No. NMJ)02354); and 
SOX17 (SEQ ID NO: 17). Positive controls CA125, MUC-1 and E-Cadherin were also 
included for comparison. 
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Figure 2 is a graphical representation showing the correlation between expression of 
different genes in serous ovarian cancer (SOC), mucinous ovarian cancer (MOC) f 
endometroid ovarian cancer (EnOC) and clear cell ovarian cancer (CICA). Genes 
5 indicated on the x-axis in each case are as in the legend to Figure 1. 

Figure 3 is a copy of a photographic representation showing immunohistochemical 
staining of ovary tissue from a normal healthy subject (normal ovary), a subject 
diagnosed with mucinous ovarian cancer (MOC) and a subject diagnosed with serous 
10 ovarian cancer (SOC), following staining with probes that are specific for L1-Cadherin 
(top row), meprin alpha (middle row) or galectin-4 (lower row). Magnification is indicated 
as 20-40X. 

Figure 4a is a copy of a photographic representation showing immunohistochemical 
15 staining of samples from a normal healthy subject (normal) or primary serous ovarian 
tumor (SOC), following staining with probes that are specific for sFRP4 (top row), or 
SOCS3 (lower row). Magnification is indicated as 20X. 

Figure 4b is a copy of a graphical representation showing a Kaplan-Meier survival curve 
20 correlating sFRP4 expression to patient survival over the medium term (i.e., from about 
12 months to about 48 months) to long term (more than about 48 months), indicating that 
high expression of sFRP4 is associated with poor survival in patients (n=127) having 
SOC (p=0.0056). 

25 Detailed description of the preferred embodiments 
Ovarian cancer-associated sequences 

Ovarian cancer-associated sequences can include both nucleic acid (i.e., "ovarian 
cancer-associated genes") and protein (i.e., "ovarian cancer-associated proteins"). 

30 As used herein, the term "ovarian cancer-associated protein" shall be taken to mean any 
protein that has an expression pattern correlated to an ovarian cancer, the recurrence of 
an ovarian cancer or the survival of a subject suffering from ovarian cancer. 
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Similarly, the term "ovarian cancer-associated gene" shall be taken to mean any nucleic 
acid encoding an ovarian cancer-associated protein or nucleic acid having an expression 
profile that is correlated to an ovarian cancer, the recurrence of an ovarian cancer or the 
survival of a subject suffering from ovarian cancer. 

As will be appreciated by those in the art and is more fully outlined below, ovarian 
cancer-associated genes are useful in a variety of applications, including diagnostic 
applications, which will detect naturally occurring nucleic acids, as well as screening 
applications; e.g., biochips comprising nucleic acid probes or PCR microtitre plates with 
10 selected probes to the ovarian cancer sequences are generated. 

For identifying ovarian cancer-associated sequences, the ovarian cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous 
tissues, or tumour tissue samples from patients who have metastatic disease vs. non 

15 metastatic tissue. Other suitable tissue comparisons include comparing ovarian cancer 
samples with metastatic cancer samples from other cancers, such as lung, breast, 
gastrointestinal cancers, ovarian, etc. Samples of different stages of ovarian cancer, e.g., 
survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if 

20 applicable, and treated as is known in the art for the preparation of mRNA. Suitable 
biochips are commercially available, e.g. from Affymetrix. Gene expression profiles as 
described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
25 disease states are compared to genes expressed in other normal tissues, preferably 
normal ovarian, but also including, and not limited to lung, heart, brain, liver, breast, 
kidney, muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a 
preferred embodiment, those genes identified during the ovarian cancer screen that are 
expressed in any significant amount In other tissues are removed from the profile, 
30 although in some embodiments, this is not necessary. That is, when screening for drugs, 
it is usually preferable that the target be disease specific, to minimise possible side 
effects. 

In a preferred embodiment, ovarian cancer-associated sequences are those that are up- 
35 regulated in ovarian cancer; that is, the expression of these genes is modifed (up- 
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regulated or down-regulated) in ovarian cancer tissue as compared to non-cancerous 
tissue (see Table 1). 

"Up-regulation" as used herein means at least about a two-fold change, preferably at 
5 least about a three fold change, with at least about five-fold or higher being preferred. All 
Unigene cluster identification numbers and accession numbers herein are for the 
GenBank sequence database and the sequences of the accession numbers are hereby 
expressly incorporated by reference. Sequences are also available in other databases, 
e.g., European Molecular Biology Laboratory (EMBL) and DNA Database of Japan 
10 (DDBJ). 

"Down-regulation" as used herein often means at least about a 1.5-fold change more 

preferably a two-fold change, preferably at least about a three fold change, with at least j 

about five-fold or higher being most preferred. j 

15 

Particularly preferred sequences are those referred to in Tables 1 or 3 that have a P 
value of less than 0.05, more preferably a P value of less than about 0.01 . 

it 

Similarly, preferred sequences are those referred to in Table 2 as having an absolute 
20 ratio of expression between ovarian patients and normal patients of at least about ±5.0, 
more preferably at least about ±6.0, even more preferably at leat about ±7.0 or at least 
about ±8.0 or at least about ±9.0 or at least about ±10.0. 

Detection of ovarian cancer sequences for diagnostic/prognostic applications 
25 In one aspect the RNA expression levels of genes are determined for different cellular 
states in the ovarian cancer phenotype. Expression levels of genes in normal tissue (i.e., 
not undergoing ovarian cancer) and in ovarian cancer tissue (and in some cases, for 
varying severities of ovarian cancer that relate to prognosis, as outlined below) are 
evaluated to provide expression profiles. An expression profile of a particular cell state or 
30 point of development is essentially a "fingerprint" of the state. While two states may have 
any particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the 
state of the cell. By comparing expression profiles of cells in different states, information < 
regarding which genes are important (including both up- and down-regulation of genes) 
35 in each of these states is obtained. Then, diagnosis are performed or confirmed to 

5 
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determine whether a tissue sample has the gene expression profile of normal or 
cancerous tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to qualitative 
or quantitative differences in the temporal and/or cellular gene expression patterns within 
and among cells and tissue. Thus, a differentially expressed gene can qualitatively have 
its expression altered, including an activation or inactivation, in, e.g., normal versus 
ovarian cancer tissue. Genes are turned on or turned off in a particular state, relative to 
another state thus permitting comparison of two or more states. A qualitatively regulated 
gene will exhibit an expression pattern within a state or cell type which is detectable by 
standard techniques. Some genes will be expressed in one state or cell type, but not in 
both. Alternatively, the difference in expression are quantitative, e.g., in that expression is 
increased or decreased; i.e., gene expression is either upregulated, resulting in an 
increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify 
via standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChipTM expression arrays, Lockhart. Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited 
to, quantitative reverse transcriptase PCR. northern analysis and RNase protection. As 
outlined above, preferably the change in expression (i.e., upregulation or downregulation) 
is at least about 50%, more preferably at least about 100%, more preferably at least 
about 150%, more preferably at least about 200%, with from 300 to at least 1000% being 
especially preferred. 

Evaluation are at the gene transcript or the protein level. The amount of gene expression 
are monitored using nucleic acid probes to the DNA or RNA equivalent of the gene 
transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) are monitored, e.g., with antibodies to the ovarian cancer- 
associated protein and standard immunoassays (ELISAs, etc.) or other techniques, 
including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins 
corresponding to ovarian cancer genes, i.e., those identified as being important in a 
ovarian cancer phenotype, are evaluated in a ovarian cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed on a plurality of 
genes. Multiple protein expression monitoring are performed as well. Similarly, these 
assays are performed on an individual basis as well. 
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In this embodiment, the ovarian cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of ovarian cancer sequences in a 
particular cell. The assays are further described below in the example. PCR techniques 
5 are used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the ovarian cancer-associated protein 
are detected. Although DNA or RNA encoding the ovarian cancer-associated protein are 
detected, of particular interest are methods wherein an mRNA encoding a ovarian 

10 cancer-associated protein is detected. Probes to detect mRNA are a 
nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the 
mRNA and includes, but is not limited to, oligonucleotides, cDNA or RNA. Probes also 
should contain a detectable label, as defined herein. In one method the mRNA is 
detected after immobilizing the nucleic acid to be examined on a solid support such as 

15 nylon membranes and hybridizing the probe with the sample. Following washing to 
remove the non-specifically bound probe, the label is detected. In another method 
detection of the mRNA is performed in situ. In this method permeabilized cells or tissue 
samples are contacted with a detectably labeled nucleic add probe for sufficient time to 
allow the probe to hybridize with the target mRNA. Following washing to remove the 

20 non-specifically bound probe, the label is detected. For example a digoxygenin labeled 
riboprobe (RNA probe) that is complementary to the mRNA encoding a ovarian cancer- 
associated protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

25 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in 
diagnostic assays. The ovarian cancer-associated proteins, antibodies, nucleic acids, 
modified proteins and cells containing ovarian cancer sequences are used in diagnostic 
30 assays. This are performed on an individual gene or corresponding polypeptide level. In 
a preferred embodiment, the expression profiles are used, preferably in conjunction with 
high throughput screening techniques to allow monitoring for expression profile genes 
and/or corresponding polypeptides. 

35 As described and defined herein, ovarian cancer-associated proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of ovarian cancer. 
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Detection of these proteins in putative ovarian cancer tissue allows for detection or 
diagnosis of ovarian cancer. In one embodiment, antibodies are used to detect ovarian 
cancer-associated proteins. A preferred method separates proteins from a sample by 
electrophoresis on a gel (typically a denaturing and reducing protein gel, but are another 
5 type of gel, including isoelectric focusing gels and the like). Following separation of 
proteins, the ovarian cancer-associated protein is detected, e.g., by immunoblotting with 
antibodies raised against the ovarian cancer-associated protein. Methods of 
immunoblotting are well known to those of ordinary skill in the art. 

10 In another preferred method, antibodies to the ovarian cancer-associated protein find use 
in in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies 
in Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from 
one to many antibodies to the ovarian cancer-associated protein(s). Following washing to 
remove non-specific antibody binding, the presence of the antibody or antibodies is 

15 detected. In one embodiment the antibody is detected by incubating with a secondary 
antibody that contains a detectable label. In another method the primary antibody to the 
ovarian cancer-associated proteins) contains a detectable label, e.g. an enzyme marker 
that can act on a substrate. In another preferred embodiment each one of multiple 
primary antibodies contains a distinct and detectable label. This method finds particular 

20 use in simultaneous screening for a plurality of ovarian cancer-associated proteins. As 
will be appreciated by one of ordinary skill in the art, many other histological imaging 
techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability to 
25 detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) are used in the method. In another preferred embodiment, 
antibodies find use in diagnosing ovarian cancer from blood, serum, plasma, stool, and 
other samples. Such samples, therefore, are useful as samples to be probed or tested for. 
the presence of ovarian cancer-associated proteins. Antibodies are used to detect a 
30 ovarian cancer-associated protein by previously described immunoassay techniques 
including ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 
response against an endogenous ovarian cancer-associated protein. 



35 In a preferred embodiment, in situ hybridization of labeled ovarian cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including ovarian 
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cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on 
the findings. It is further understood that the genes which indicate the diagnosis may 
differ from those which indicate the prognosis and molecular profiling of the condition of 
the cells may lead to distinctions between responsive or refractory conditions or are 
predictive of outcomes. 

In a preferred embodiment, the ovarian cancer-associated proteins, antibodies, nucleic 
acids, modified proteins and cells containing ovarian cancer sequences are used in 
prognosis assays. As above, gene expression profiles are generated that correlate to 
ovarian cancer, in terms of long term prognosis. Again, this are done on either a protein 
or gene level, with the use of genes being preferred. As above, ovarian cancer probes 
are attached to biochips for the detection and quantification of ovarian cancer sequences 
in a tissue or patient. The assays proceed as outlined above for diagnosis. PCR method 
may provide more sensitive and accurate quantification. 

Characteristics of ovarian cancer-associated proteins and genes encoding same 
Ovarian cancer-associated proteins of the present invention are classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the 
ovarian cancer-associated protein is an intracellular protein. Intracellular proteins are 
found in the cytoplasm and/or in the nucleus. Intracellular proteins are involved in all 
aspects of cellular function and replication (including, e.g., signaling pathways); aberrant 
expression of such proteins often results in unregulated or disregulated cellular 
processes (see, e.g., Molecular Biology of the Cell (Alberts, ed., 3rd ed., 1994). For 
example, many intracellular proteins have enzymatic activity such as protein kinase 
activity, protein phosphatase activity, protease activity, nucleotide cyclase activity, 
polymerase activity and the like. Intracellular proteins also serve as docking proteins that 
are involved in organizing complexes of proteins, or targeting proteins to various 
subcellular localizations, and are involved in maintaining the structural integrity of 
organelles. 

An increasingly appreciated concept in characterising proteins is the presence in the 
proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, 
highly conserved sequences have been identified in proteins that are involved in 
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protein-protein interaction. For example, Src-homology-2 (SH2) domains bind 
tyrosine-phosphorylated targets in a sequence dependent manner. PTB domains, which 
are distinct from SH2 domains, also bind tyrosine phosphorytated targets. SH3 domains 
bind to proline-rich targets. In addition, PH domains, tetratricopeptide repeats and WD 
domains to name only a few, have been shown to mediate proteiivprotein interactions. 
Some of these may also be involved in binding to phospholipids or other second 
messengers. As will be appreciated by one of ordinary skill in the art, these motifs are 
identified on the basis of primary sequence; thus, an analysis of the sequence of proteins 
may provide insight into both the enzymatic potential of the molecule and/or molecules 
with which the protein may associate. One useful database is Pfam (protein families), 
which is a large collection of multiple sequence alignments and hidden Markov models 
covering many common protein domains. Versions are available via the internet from 
Washington University in St. Louis, the Sanger Center in England, and the Karolmska 
Institute in Sweden (see, e.g., Bateman et al., 2000, Nuc. Adds Res. 28: 263-266; 
Sonnhammer et al., 1997, Proteins 28: 405-420; Bateman et al., 1999, Nuc. Acids Res. 
27:260-262; and Sonnhammer et al., 1998, Nuc. Acids Res. 26: 320-322. 

In another embodiment, the ovarian cancer sequences are transmembrane proteins. 
Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They 
may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already 
described for intracellular proteins. For example, the intracellular domain may have 
enzymatic activity and/or may serve as a binding site for additional proteins. Frequently 
the intracellular domain of transmembrane proteins serves both roles. For example 
certain receptor tyrosine kinases have both protein kinase activity and SH2 domains. In 
addition, autophosphorylation of tyrosines on the receptor molecule itself, creates binding 
sites for additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For 
example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl 
cyclases and receptor serine/threonine protein kinases contain a single transmembrane 
domain. However, various other proteins including channels and adenylyl cyclases 
contain numerous transmembrane domains. Many important cell surface receptors such 
as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids 
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that are followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein are predicted (see, e.g. PSORT web site http://psortnibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
5 receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
transporters, transferrin receptor, epidermal growth factor receptor, low density 
lipoprotein receptor, epidermal growth factor receptor, leptin receptor, interleukin 
receptors, e.g. IL-1 receptor, IL-2 receptor, 

10 The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains 
are found on receptors. Factors that bind the receptor domain include circulating ligands, 

15 which are peptides, proteins, or small molecules such as adenosine and the like. For 
example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors 
include cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular 
domains also bind to cell-associated molecules. In this respect, they mediate cell-cell 

20 interactions., Cell-associated ligands are tethered to the cell, e.g., via a 
glycosylphosphatidylinositol (GPI) anchor, or may themselves be transmembrane 
proteins. Extracellular domains also associate with the extracellular matrix and contribute 
to the maintenance of the cell structure. 

25 Ovarian cancer-associated proteins that are transmembrane are particularly preferred in 
the present invention as they are readily accessible targets for immunotherapeutics, as 
are described herein. In addition, as outlined below, transmembrane proteins are also 
useful in imaging modalities. Antibodies are used to label such readily accessible 
proteins in situ. Alternatively, antibodies can also label intracellular proteins, in which 

30 case samples are typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein are made 
soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble are made to be 
35 secreted through recombinant means by adding an appropriate signal sequence. 
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In another embodiment, the ovarian cancer-associated proteins are secreted proteins; 
the secretion of which are either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous . physiological events; by virtue of their circulating 
5 nature, they serve to transmit signals to various other cell types. The secreted protein 
may function in an autocrine manner (acting on the cell that secreted the factor), a 
paracrine manner (acting on cells in close proximity to the cell that secreted the factor) or 
an endocrine manner (acting on cells at a distance). Thus secreted molecules find use in 
modulating or altering numerous aspects of physiology. Ovarian cancer-associated 
10 proteins that are secreted proteins are particularly preferred in the present invention as 
they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or 
stool tests. 

Mammalian subjects 

15 The present invention provides nucleic acid and protein sequences that are differentially 
expressed in ovarian cancer, herein termed "ovarian cancer sequences." As outlined 
below, ovarian cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in ovarian cancer, as well as those that are down-regulated (i.e., 
expressed at a lower level). In a preferred embodiment, the ovarian cancer sequences 

20 are from humans; however, as will be appreciated by those in the art, ovarian cancer 
sequences from other organisms are useful in animal models of disease and drug 
evaluation; thus, other ovarian cancer sequences are provided, from vertebrates, 
including mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, 
farm animals (including sheep, goats, pigs, cows, horses, etc.) and pets, e.g., (dogs, 

25 cats, etc.). 

Assay control samples 

It will be apparent from the preceding discussion that many of the diagnostic methods 
provided by the present invention involve a degree of quantification to determine, on the 

30 one hand, the over-expression or reduced-expression of a diagnostic/prognostic marker 
in tissue that is suspected of comprising a cancer cell. Such quantification can be readily 
provided by the inclusion of appropriate control samples in the assays described below, 
derived from healthy or normal individuals. Alternatively, if internal controls are not 
included in each assay conducted, the control may be derived from an established data 

35 set that has been generated from healthy or normal individuals. 
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In the present context, the term "healthy individuar shall be taken to mean an individual 
who is known not to suffer from ovarian cancer, such knowledge being derived from 
clinical data on the individual, including, but not limited to, a different cancer assay to that 
described herein. As the present invention is particularly useful for the early detection of 
5 ovarian cancer, it is preferred that the healthy individual is asymptomatic with respect to 
the early symptoms associated with ovarian cancer. Although early detection using well- 
known procedures is difficult, reduced urinary frequency, rectal pressure, and abdominal 
bloating and swelling, are associated with the disease in its early stages, and, as a 
consequence, healthy individuals should not have any of these clinical symptoms. 
10 Clearly, subjects suffering from later symptoms associated with ovarian cancer, such as, 
for example, metastases in the omentum, abdominal fluid, lymph nodes, lung, liver, brain, 
or bone, and subjects suffering from spinal cord compression, elevated calcium level, 
chronic pain, or pleural effusion, should also be avoided from the "healthy individual" data 
set. 

15 

The term °norma! individual" shall be taken to mean an individual having a normal level of 
expression of a cancer-associate gene or cancer-associated protein in a particular 
sample derived from said individual. As will be known to those skilled in the art, data 
obtained from a sufficiently large sample of the population will normalize, allowing the 

20 generation of a data set for determining the average level. of a particular parameter. 
Accordingly, the level of expression of a cancer-associate gene or cancer-associated 
protein can be determined for any population of individuals, and for any sample derived 
from said individual, for subsequent comparison to levels determined for a sample being 
assayed. Where such normalized data sets are relied upon, internal controls are 

25 preferably included in each assay conducted to control for variation. 

In one embodiment the present invention provides a method for detecting a cancer cell 
in a subject, said method comprising: 

(i) determining the level of mRNA encoding a cancer-associated protein expressed 
30 in a test sample from said subject; and 

(ii) comparing the level of mRNA determined at (i) to the level of mRNA encoding a 
cancer-associated protein expressed in a comparable sample from a healthy or 
normal individual, 

wherein a level of mRNA at (i) that is modified in the test sample relative to the 
35 comparable sample from the normal or healthy individual is indicative of the presence of 
a cancer cell in said subject. 
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Alternatively, or in addition, the controll may comprise a cancer-associated sequence that 
is known to be expressed at a particular level in an ovarian cancer, eg., CA125, MUC-1 
or E-Cadherin, amongast others. 

5 

Biological samples 

Preferred biological samples in which the assays of the invention are performed include 
bodily fluids, ovarian tissue and cells, and those tissues known to comprise cancer cells 
arising from a metastasis of an ovarian cancer, such as, for example, in carcinomas of 
10 the lung, prostate, breast, colon, pancreas, placenta, or omentum , and in cells of brain 
anaplastic oligodendrogliomas. 

Bodily fluids shall be taken to include whdle blood, serum, peripheral blood mononuclear 
cells (PBMC), or buffy coat fraction. 

15 

In the present context, the term "cancer cell" includes any biological specimen or sample 
comprising a cancer cell irrespective of its degree of isolation or purity, such as, for 
example, tissues, organs, ceil lines, bodily fluids, or histology specimens that comprise a 
cell in the early stages of transformation or having been transformed. 

20 

As the present invention is particularly useful for the early detection and prognosis of 
cancer ofe rthe medium to long term, the definition of "cancer cell" is not to be limited by 
the stage of a cancer in the subject from which said cancer cell is derived (ie. whether or 
not the patient is in remission or undergoing disease recurrence or whether or not the 
25 cancer is a primary tumor or the consequence of metastases). Nor is the term "cancer 
cell" to be limited by the stage of the cell cycle of said cancer cell. 

Preferably, the sample comprises ovarian tissue, prostate tissue, kidney tissue, uterine 
tissue, placenta, a cervical specimen, omentum, rectal tissue, brain tissue, bone tissue, 

30 lung tissue, lymphatic tissue, urine, semen, blood, abdominal fluid, or serum, or a cell 
preparation or nucleic acid preparation derived therefrom. More preferably, the sample 
comprises serum or abdominal fluid, or a tissue selected from the group consisting of: 
ovary, lymph, lung, liver, brain, placenta, brain, omentum, and prostate. Even more 
preferably, the sample comprises serum or abdominal fluid, ovary (eg. OSE), or lymph 

35 node tissue. The sample can be prepared on a solid matrix for histological analyses, or 
alternatively, in a suitable solution such as, for example, an extraction buffer or 
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suspension buffer, and the present invention clearly extends to the testing of biological 
solutions thus prepared. 



Polynucleotide probes and amplification primers 
5 Polynucleotide probes are derived from or comprise the nucleic acid sequences whose 
nucleotide sequences are provided by reference to the public database accession 
numbers given in Tables 1-3 (referred to herein as the nucleotide sequences shown in 
Tables 1-3), and sequences homologues thereto as well as variants, derivatives and 
fragments thereof. 

10 

Whilst the probes may comprise double-stranded or single-stranded nucleic acid, single- 
stranded probes are preferred because they do not require melting prior to use in 
hybridizations. On the other hand, longer probes are also preferred because they can be 
used at higher hybridization stringency than shorter probes and may produce lower 
15 background hybridization than shorter probes. 

So far as shorter probes are concerned, single-stranded, chemically-synthesized 
oligonucleotide probes are particularly preferred by the present invention. To reduce the 
noise associated with the use of such probes during hybridization, the nucleotide 

20 sequence of the probe is carefully selected to maximize the Tm at which hybridizations 
can be performed, reduce non-specific hybridization, and to reduce self-hybridization. 
Such considerations may be particularly important for applications involving high 
throughput screening using microarray technology. In general, this means that the 
nucleotide sequence of an oligonucleotide probe is selected such that it is unique to the 

25 target RNA or protein-encoding sequence, has a low propensity to form secondary 
structure, low self-complementary, and is not highly A/T-rich. 

The only requirement for the probes is that they cross-hybridize to nucleic acid encoding 
the target diagnostic protein or the complementary nucleotide sequence thereto and are 

30 sufficiently unique in sequence to generate high signalrnoise ratios under specified 
hybridization conditions. As will be known to those skilled in the art, long nucleic acid 
probes are preferred because they tend to generate higher signal:noise ratios than 
shorter probes and/or the duplexes formed between longer molecules have higher 
melting temperatures (i.e. Tm values) than duplexes involving short probes. Accordingly, 

35 full-length DNA or RNA probes are contemplated by the present invention, as are specific 
probes comprising the sequence of the 3'Hintranslated region or complementary thereto. 
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In a particularly preferred embodiment, the nucleotide sequence of an oligonucleotide 
probe has no detectable nucleotide sequence identity to a nucleotide sequence in a 
BLAST search (Altschul et a/., J. MoL Biol. 215, 403-410, 1990) or other database 
5 search, other than a sequence selected from the group consisting of: (a) a sequence 
encoding a polypeptide listed in any one of Tables 1-3; (b) the 5 -untranslated region of a 
sequence encoding a polypeptide listed in any one of Tables 1-3; (c) a 3'-untranslated 
region of a sequence encoding a polypeptide listed in any one of Tables 1-3; and (d) an 
exon region of a sequence encoding a polypeptide listed in any one of Tables 1-3. 

10 

Additionally, the self-complementarity of a nucleotide sequence can be determined by 
aligning the sequence with its reverse complement, wherein detectable regions of identity 
are indicative of potential self-complementarity. As will be known to those skilled in the 
art, such sequences may not necessarily form secondary structures during hybridization 

15 reaction, and, as a consequence, successfully identify a target nucleotide sequence. It is 
also known to those skilled in the art that, even where a sequence does form secondary 
structures during hybridization reactions, reaction conditions can be modified to reduce 
the adverse consequences of such structure formation. Accordingly, a potential for self- 
complementarity should not necessarily exclude a particular candidate oligonucleotide 

20 from selection. In cases where it is difficult to determine nucleotide sequences having no 
potential self-complementarity, the uniqueness of the sequence should outweigh a 
consideration of its potential for secondary structure formation. 

Recommended pre-requisites for selecting oligonucleotide probes, particularly with 
25 respect to probes suitable for microarray technology, are described in detail by Lockhart 
et a/., ^Expression monitoring by hybridization to high-density oligonucleotide arrays", 
Nature Biotech.14, 1675-1680, 1996. 

The nucleic acid probe may comprise a nucleotide sequence that is within the coding 
30 strand of a gene listed in any one of Tables 1-3. Such "sense" probes are useful for 
detecting RNA by amplification procedures, such as, for example, polymerase chain 
reaction (PCR), and more preferably, quantitative PCR or reverse transcription 
polymerase chain reaction (RT-PCR). Alternatively, "sense" probes may be expressed to 
produce polypeptides or immunologically active derivatives thereof that are useful for 
35 detecting the expressed protein in samples. 
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The nucleotide sequences referred to in Tables 1-3 and homologues thereof, typically 
encode polypeptides. It will be understood by a skilled person that numerous different 
polynucleotides can encode the same polypeptide as a result of the degeneracy of the 
5 genetic code. In addition, it is to be understood that skilled persons may, using routine 
techniques, make nucleotide substitutions that do not affect the polypeptide sequence 
encoded by the polynucleotides of the invention to reflect the codon usage of any particular 
host organism in which the polypeptides of the invention are to be expressed. 

10 Polynucleotides may comprise DNA or RNA. They are single-stranded or double- 
stranded. They may also be polynucleotides which include within them synthetic or 
modified nucleotides. A number of different types of modification to oligonucleotides are 
known in the art. These include methylphosphonate and phosphorothioate backbones, 
addition of acridine or polylysine chains at the 3* and/or 5' ends of the molecule. For the 

15 purposes of the present invention, it is to be understood that the polynucleotides 
described herein are modified by any method available in the art. Such modifications are 
carried out in order to enhance the in vivo activity or life span of the diagnostic/prognostic 
polynucleotides. . 

20 The terms "variant" or "derivative" in relation to the nucleotide sequences of the present 
invention include any substitution of, variation of, modification of, replacement of, deletion of 
or addition of one (or more) nucleic acid from or to the sequence provided that the resultant 
nucleotide sequence codes for a polypeptide having biological activity, preferably having 
substantially the same activity as the polypeptide sequences presented in the sequence 

25 listings. 

With respect to sequence homology, preferably there is at least 75%, more preferably at 
least 85%, more preferably at least 90% homology to a sequence shown in Tables 1-3 
herein over a region of at least 20, preferably at least 25 or 30,.for instance at least 40, 60, 
30 100, 500, 1000 or more contiguous nucleotides. More preferably there is at least 95%, 
more preferably at least 98%, homology. In one embodiment homologues are naturally 
occurring sequences, such as orthologues, tissue-specific isoforms and allelic variants. 

Homology comparisons are conducted by eye, or more usually, with the aid of readily 
35 available sequence comparison programs. These commercially available computer 
programs can calculate % homology between two or more sequences. 
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Percentage (%) homology are calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each nucleotide in one sequence directly compared 
with the corresponding nucleotide in the other sequence, one base at a time. This is called 
5 an "ungapped* alignment Typically, such ungapped alignments are performed only over a 
relatively short number of bases (for example less than 50 contiguous nucleotides). 

Although this is a very simple and consistent method, it fails to take into consideration that, 
for example, in an otherwise identical pair of sequences, one insertion or deletion will cause 

10 the following nucleotides to be put out of alignment thus potentially resulting in a large 
reduction in % homology when, a global alignment is performed. Consequently, most 
sequence comparison methods are designed to produce optimal alignments that take into 
consideration possible insertions and deletions without penalising unduly the overall 
homology score. This is achieved by inserting "gaps" in the sequence alignment to try to 

15 maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs in 
the alignment so that, for the same number of identical amino acids, a sequence alignment 
with as few gaps as possible - reflecting higher relatedness between the two compared 

20 sequences - will achieve a higher score than one with many gaps. "Affine gap costs" are 
typically used that charge a relatively high cost for the existence of a gap and a smaller 
penalty for each subsequent residue in the gap. This is the most commonly used gap 
scoring system. High gap penalties will of course produce optimised alignments with fewer 
gaps. Most alignment programs allow the gap penalties to be modified. However, it is 

25 preferred to use the default values when using such software for sequence comparisons. 

In determining whether or not two amino acid sequences fall within the stated defined 
percentage identity limits, those skilled in the art will be aware that it is necessary to 
conduct a side-by-side comparison of amino acid sequences. In such comparisons or 

30 alignments, differences will arise in the positioning of non-identical amino acid residues 
depending upon the algorithm used to perform the alignment In the present context, 
references to percentage identities and similarities between two or more amino acid 
sequences shall be taken to refer to the number of identical and similar residues 
respectively, between said sequences as determined using any standard algorithm 

35 known to those skilled in the art. In particular, amino acid identities and similarities are 
calculated using the GAP program of the Computer Genetics Group, Inc., University 
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Research Park, Madison, Wisconsin, United States of America (Devereaux et a/, Nucl. 
Acids Res. 12, 387-395,1984), which utilizes the algorithm of Needleman and Wunsch J. 
Mol. Biol. 48, 443-453, 1970, or alternatively, the CLUSTAL W algorithm of Thompson et 
a/., Nucl. Acids Res. 22, 4673-4680, 1994, for multiple alignments, to maximize the. 
5 number of identical/similar amino acids and to minimize the number and/or length of 
sequence gaps in the alignment. 

A suitable computer program for canying out such an alignment is the GCG Wisconsin 
Bestfit package (University of Wisconsin, U.SA; Devereux et al., 1984, Nucleic Acids 
10 Research 12:387). The default scoring matrix has a match value of 10 for each identical 
nucleotide and -9 for each mismatch. The default gap creation penalty is -50 and the default 
gap extension penalty is -3 for each nucleotide. 

Examples of other software than can perform sequence comparisons include, but are not 
15 limited to, the BLAST package (see Ausubel et al, 1999 ibid - Chapter 18), FASTA 
(Atschul et -at, 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison 
tools. Both BLAST and FASTA are available for offline and online searching (see 
Ausubel et a/., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG 
Bestfit program. 

20 

Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of the 
sequence comparison and generates a numerical result. 

25 A preferred sequence comparison program is the GCG Wisconsin Bestfit program 
described above. 

The present invention also encompasses the use of nucleotide sequences that are capable 
of hybridizing selectively to the sequences presented herein, or any variant, fragment or 
30 derivative thereof, or to the complement of any of the above. Nucleotide sequences are 
preferably at least 15 nucleotides in length, more preferably at least 20, 30, 40 or 50 
nucleotides in length. 

The term "hybridization* as used herein shall include "the process by which a strand of 
35 nucleic acid joins with a complementary strand through base pairing" as well as the 
process of amplification as carried out in polymerase chain reaction technologies. 
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Polynucleotides capable of selectively hybridizing to the nucleotide sequences presented 
herein, or to their complement, will be generally at least 70%, preferably at least 80 or 90% 
and more preferably at least 95% or 98% homologous to the corresponding nucleotide 
sequences referred to in Tables 1-3 over a region of at least 20, preferably at least 25 or 30, 
for instance at least 40, 60, 100, 500, 1000 or more contiguous nucleotides. 

The term "selectively hybridizable" means that the polynucleotide used as a probe is used 
under conditions where a target polynucleotide is found to hybridize to the probe at a level 
significantly above background. The background hybridization may occur because of other 
polynucleotides present, for example, in the cDNA or genomic DNA library being screening. 
In this event, background implies a level of signal generated by interaction between the 
probe and a non-specific DNA member of the library which is less than 10 fold, preferably 
less than 100 fold as intense as the specific interaction observed with the target DNA. The 
intensity of interaction are measured, for example, by radiolabelling the probe, e.g. with "P. 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid 
binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego CA), and 
confer a defined "stringency" as explained below. 

For the purposes of defining the level of stringency, a high stringency hybridization is 
achieved using a hybridization buffer and/or a wash solution comprising the following: 

(i) a salt concentration that is equivalent to 0.1xSSG-0.2xSSC buffer or lower salt 
concentration; 

(ii) a detergent concentration equivalent to 0. 1 % (w/v) SDS or higher; and 

(iii) an incubation temperature of 55°C or higher. 

Conditions for specifically hybridizing nucleic acid, and conditions for washing to remove 
non-specific hybridizing nucleic acid, are well understood by those skilled in the art. For 
the purposes of further clarification only, reference to the parameters affecting 
hybridization between nucleic acid molecules is found in Ausubel et al. (Current 
Protocols in Molecular Biology, Wiley Interscience, ISBN 047150338, 1992), which is 
herein incorporated by reference. 
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Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of the probe); 
high stringency at about 5°C to 10°G below Tm; intermediate stringency at about 10°C to 
20°C below Tm; and low stringency at about 20°C to 25°C below Tm. As will be 
understood by those of skill in the ait a maximum stringency hybridization are used to 
5 identify or detect identical polynucleotide sequences while an intermediate (or low) 
stringency hybridization are used to identify or detect similar or related polynucleotide 
sequences. 

In a preferred aspect, the present invention covers nucleotide sequences that can hybridize 
10 to the nucleotide sequence of the present invention under stringent conditions (e.g. 65°C 
and 0:1xSSC {1xSSC = 0.15 M NaCI, 0.015 M Na 3 Citrate pH 7.0}). 

Where the diagnostic/prognostic polynucleotide is double-stranded, both strands of the 
duplex, either individually or in combination, are encompassed by the present invention. 
15 Where the polynucleotide is single-stranded, it is to be understood that the complementary 
sequence of that polynucleotide is also included within the scope of the present invention. 

Polynucleotides which are not 100% homologous to the sequences of the present invention 
but are useful in perfoming the diagnostic and/or prognostic assays of the invetnion by 

20 virtue of their ability to selectively hybridize to the target gene transcript, or to encode an 
immunologically cross-reactive protein to the target protein, are obtained in a number of 
ways, such as, for example by probing DNA libraries made from a range of individuals, for 
example individuals from different populations. In particular, given that that changes in the 
expression of diagnostic/prognostic cancer-associated genes correlate with ovarian cancer, 

25 characterisation of variant sequences in individuals suffering from ovarian cancer is used to 
identify variations in the sequences of ovarian-cancer associated genes (and proteins) that 
are predictive of and/or causative of ovarian cancer. 

Accordingly the present invention provides methods of identifying sequence variants that 
30 are associated with ovarian cancer which methods comprise determining all or part of the 
nucleotide sequence of a gene referred to in Tables 1-3, derived from an individual suffering 
from ovarian cancer and comparing the sequence to that of the corresponding wild-type 
gene. 



35 In addition, other viral/bacterial, or cellular homologues particularly cellular homologues 
found In mammalian cells (e.g. rat, mouse, bovine and primate cells), are obtained and such 
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homologues and fragments thereof in general will be capable of selectively hybridizing to 
the sequences shown in the sequence listing herein. Such sequences are obtained by 
probing cDNA libraries made from or genomic DNA libraries from other animal species, and 
probing such libraries with probes comprising all or part of the sequences referred to in 
Tables 1-3 under conditions of medium to high stringency. Similar considerations apply to 
obtaining species homologues and allelic variants of the nucleotide sequences referred to in 
Tables 1-3. 

Variants and strain/species homologues may also be obtained using degenerate PCR 
which will use primers designed to target sequences within the variants and homologues 
encoding conserved amino acid sequences within the sequences of the present invention. 
Conserved sequences are predicted, for example, by aligning the amino acid sequences 
from several variants/homologues. Sequence alignments are performed using computer 
software known in the art. For example the GCG Wisconsin PileUp program is widely used. 

The primers used in degenerate PCR will contain one or more degenerate positions and will 
be used at stringency conditions lower than those used for cloning sequences with single 
sequence primers against known sequences. 

Alternatively, such polynucleotides are obtained by site-directed mutagenesis of 
characterised sequences, such as the sequences referred to in Tables 1-3. This are useful 
where for example silent codon changes are required to sequences to optimise codon 
preferences for a particular host cell in which the polynucleotide sequences are being 
expressed. Other sequence changes are desired in order to introduce restriction enzyme 
recognition sites, or to alter the property or function of the polypeptides encoded by the 
polynucleotides.. 

Polynucleotides comprising a diagnostic/prognostic cancer-associated gene are used to 
produce a primer by standard derivatization means, e.g. a PCR primer, a primer for an 
alternative amplification reaction, a probe e.g. labelled with a detectable label by 
conventional means using radioactive or non-radioactive labels, or the polynucleotides are 
cloned into vectors. Such primers, probes and other fragments will be at least 15, preferably 
at least 20, for example at least 25, 30 or 40 nucleotides in length. Preferred fragments are 
less than 5000, 2000, 1000, 500 or 200 nucleotides in length. 



o o 

WO 2004/022778 PCT/AU2003/001 166 

54 

Polynucleotides such as a DNA polynucleotides and probes according to the invention are 
produced by recombinant or synthetic means, including cloning by standard techniques. 

In general, primers will be produced by synthetic means, involving a step wise manufacture 
5 of the desired nucleic acid sequence one nucleotide at a time. Techniques for 
accomplishing this using automated techniques are readily available in the art 

Longer polynucleotides will generally be produced using recombinant means, for example 
using PCR (polymerase chain reaction) cloning techniques. This will involve making a pair 

10 of primers (e.g. of about 15 to 30 nucleotides) flanking a region of the sequence which it is . 
desired to clone, bringing the primers into contact with mRNA or cDNA obtained from an 
enimal or human cell, performing a polymerase chain reaction under conditions which bring 
about amplification of the desired region, isolating the amplified fragment (e.g. by purifying 
the reaction mixture on an agarose gel) and recovering the amplified DNA. The primers are 

15 designed to contain suitable restriction enzyme recognition sites so that the amplified DNA 
are cloned into a suitable cloning vector 

Polynucleotide probes or primers preferably carry a detectable label. Suitable labels 
include radioisotopes such as ^P or 35 S, enzyme labels, or other protein labels such as 
20 blotin. Such labels are added to polynucleotides or primers and are detected using by 
techniques known in the art. 

Polynucleotide probes or primers labeled or unlabeled are also used by a person skilled 
in the art in nucleic acid-based tests for detecting or sequencing diagnostic/prognostic 
25 cancer-associated gene. 

Such tests for detecting generally comprise bringing a biological sample containing DNA 
or RMA into contact with a probe comprising a polynucleotide probe or primer under at 
least low stringency hybridization conditions and detecting any duplex formed between 

30 the probe/primer and nucleic acid in the sample. Such detection are achieved using 
techniques such as PCR or by immobilising the probe on a solid support, removing 
nucleic acid in the sample which is not hybridized to the probe, and then detecting 
nucleic acid which has hybridized to the probe. Alternatively, the sample nucleic acid are 
immobilised on a solid support, and the amount of probe bound to such a support are 

35 detected. Suitable assay methods of this and other formats are found in for example 
W089/03891 and W090/1 3667. 
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Tests for sequencing nucleotides include bringing a biological sample containing target 
DNA or RNA into contact with a probe comprising a polynucleotide probe or primer under 
at least low stringency hybridization conditions and determining the sequence by, for 
example the Sanger dideoxy chain termination method (see Sambrook et al.). 

Such a method generally comprises elongating, in the presence of suitable reagents, the 
primer by synthesis of a strand complementary to the target DNA or RNA and selectively 
terminating the elongation reaction at one or more of an A, C, G or T/U residue; allowing 
strand elongation and termination reaction to occur; separating out according to size the 
elongated products to determine the sequence of the nucleotides at which selective 
termination has occurred. Suitable reagents include a DNA polymerase enzyme, the 
deoxynucleotides dATP, dCTP, dGTP and dTTP, a buffer and ATP. Dideoxynucleotides 
are used for selective termination. 

Tests for detecting or sequencing nucleotides in a biological sample are used as part of 
the methods of the invention for detecting ovarian cancer-associated transcripts and 
monitoring the efficacy of treatment of patients suffering from ovarian cancer as 
described in more detail herein. 

The probes/primers may conveniently be packaged in the form of a test kit in a suitable 
container. In such kits the probe are bound to a solid support where the assay format for 
which the kit is designed requires such binding. The kit may also contain suitable 
reagents for treating the sample to be probed, hybridizing the probe to nucleic acid in the 
sample, control reagents, instructions, and the like. 

Preferably, a kit of the invention comprises primers/probes suitable for selectively 
detecting a plurality of sequences, more preferably for selectively detecting a plurality of 
sequences that are listed in one or more of Tables 1-3 as having a P value of less than 
0.05, more preferably a P value of less than 0.01. Similarly, a kit of the invention 
preferably comprises primers suitable for selectively detecting a plurality of sequences 
referred to in Table 1 or 2 or 3. 

Nucleic acid-based assay formats 

As discussed in detail below, the status of expression of a cancer-associated gene in 
patient samples may be analyzed by a variety protocols that are well known in the art 
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including in situ hybridization, northern blotting techniques, RT-PCR analysis (such as, 
for example, performed on laser capture microdissected samples), and microarray 
technology, such as, for example, using tissue microarrays probed with nucleic acid 
probes, or nucleic acid microarrays (ie. RNA microarrays or amplified DNA microarrays) 
microarrays probed with nucieic acid probes. All such assay formats are encompassed 
by the present invention. 

For high throughput screening of large numbers of samples, such as, for example, public 
health screening of subjects, particularly human subjects, having a higher risk of 
developing cancer, microarray technology is a preferred assay format. 

In accordance with such high throughput formats, techniques for producing immobilised 
arrays of DNA molecules have been described in the art. Generally, most prior art 
methods describe how to synthesise single-stranded nucleic acid molecule arrays, using 
for example masking techniques to build up various permutations of sequences at the 
various discrete positions on the solid substrate. U.S. Patent No. 5,837,832, the contents 
of which are incorporated herein by reference, describes an improved method for 
producing DNA arrays immobilised to silicon substrates based on very large scale 
integration technology. In particular, U.S. Patent No. 5,837,832 describes a strategy called 
"tiling" to synthesize specific sets of probes at spatially-defined locations on a substrate 
which are used to produced the immobilised DNA arrays. U.S. Patent No. 5,837,832 also 
provides references for earlier techniques that may also be used. 

Thus ONA are synthesised in situ on the surface of the substrate. However, DNA may 
also be printed directly onto the substrate using for example robotic devices equipped 
with either pins or piezo electric devices. 

The plurality of polynucleotide sequences are typically immobilised onto or in discrete 
regions of a solid substrate. The substrate are porous to allow immobilisation within the 
substrate or substantially non-porous, in which case the library sequences are typically 
immobilised on the surface of the substrate. The solid substrate are made of any material 
to which polypeptides can bind, either directly or indirectly. Examples of suitable solid 
substrates include flat glass, silicon wafers, mica, ceramics and organic polymers such 
as plastics, including polystyrene and polymethacrylate. It may also be possible to use 
semi-permeable membranes such as nitrocellulose or nylon membranes, which are 
widely available. The semi-permeable membranes are mounted on a more robust solid 
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surface such as glass. The surfaces may optionally be coated with a layer of metal, such 
as gold, platinum or other transition metal. A particular example of a suitable solid 
substrate is the commercially available BIACore™ chip (Pharmacia Biosensors). 

5 Preferably, the solid substrate is generally a material having a rigid or semi-rigid surface. 
In preferred embodiments, at least one surface of the substrate will be substantially flat, 
although in some embodiments it are desirable to physically separate synthesis regions 
for different polymers with, for example, raised regions or etched trenches. It is also 
preferred that the solid substrate is suitable for the high density application of DNA 
10 sequences in discrete areas of typically from 50 to 100 pm, giving a density of 10000 to 
40000 cm" 2 . 

The solid substrate is conveniently divided up into sections. This are achieved by 
techniques such as photoetching, or by the application of hydrophobic inks, for example 
15 teflon-based inks (CeMine, USA). 

Discrete positions, in which each different member of the array is located may have any 
convenient shape, e.g., circular, rectangular, elliptical, wedge-shaped, etc. 

20 Attachment of the polynucleotide sequences to the substrate are by covalent or non- 
covalent means. The plurality of polynucleotide sequences are attached to the substrate 
via a layer of molecules to which the sequences bind. For example, the sequences are 
labelled with biotin and the substrate coated with avidin and/or streptavidin. A 
convenient feature of using biotinylated sequences is that the efficiency of coupling to the 

25 solid substrate are determined easily. Since the library sequences may bind only poorly 
to some solid substrates, it is often necessary to provide a chemical interface between 
the solid substrate (such as in the case of glass) and the sequences. Examples of 
suitable chemical interfaces include hexaethylene glycol. Another example is the use of 
polylysine coated glass, the polylysine then being chemically modified using standard 

30 procedures to introduce an affinity ligand. Other methods for attaching molecules to the 
surfaces of solid substrate by the use of coupling agents are known in the art, see for 
example W098/49557. 

The complete DNA array is typically read at the same time by charged coupled device 
35 (CCD) camera or confocal imaging system. Alternatively, the DNA array are placed for 
detection in a suitable apparatus that can move in an x-y direction, such as a plate 
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reader. In this way, the change in characteristics for each discrete position are 
measured automatically by computer controlled movement of the array to place each 
discrete element in turn in line with the detection means. 

The detection means are capable of interrogating each position in the library array 
optically or electrically. Examples of suitable detection means include CCD cameras or 
confocal imaging systems. 

In a preferred embodiment, the ievel of expression of the cancer-associated gene in the 
tesfcsample is determined by hybridizing a probe/primer to RNA in the test sample under 
at least low stringency hybridization conditions and detecting the hybridization using a 
detection means. 



Similarly, the level of mRNA in the comparable sample from the healthy or normal 
individual is preferably determined by hybridizing a probe/primer to RNA in said 
comparable sample under at least low stringency hybridization conditions and detecting 
the hybridization using a detection means. 

For the purposes of defining the level of stringency to be used in these diagnostic 
assays, a low stringency is defined herein as being a hybridization and/or a wash carried 
out in 6xSSC buffer, 0.1% (w/v) SDS at 28°C, or equivalent conditions. A moderate 
stringency is defined herein as being a hybridization and/or washing carried out in 2xSSC 
buffer, 0.1% (w/v) SDS at a temperature in the range 45°C to 65°C, or equivalent 
conditions. A high stringency is defined herein as being a hybridization and/or wash 
earned out in O.lxSSC buffer, 0.1% (w/v) SDS, or lower salt concentration, and at a 
temperature of at least 65°C, or equivalent conditions. Reference herein to a particular 
level of stringency encompasses equivalent conditions using wash/hybridization solutions 
other than SSC known to those skilled in the art 

Generally, the stringency is increased by reducing the concentration of SSC buffer, 
and/or increasing the concentration of SDS and/or increasing the temperature of the 
hybridization and/or wash. Those skilled in the art will be aware that the conditions for 
hybridization and/or wash may vary depending upon the nature of the hybridization 
matrix used to support the sample RNA, or the type of hybridization probe used. 
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In general, the sample or the probe is immobilized on a solid matrix or surface (e.g., 
nitrocellulose). For high throughput screening, the sample or probe will generally 
comprise an array of nucleic acids on glass or other solid matrix, such as, for example, 
as described in WO 96/17958. Techniques for producing high density arrays are 
5 described, for example, by Fodor et a/., Science 767-773, 1991, and in U.S. Pat No. 
5,143,854. Typical protocols for other assay formats can be found, for example in 
Current Protocols In Molecular Biology, Unit 2 (Northern Blotting), Unit 4 (Southern 
Blotting), and Unit 18 (PCR Analysis), Frederick M. Ausubul etai (ed)., 1995. 

10 The detection means according to this aspect of the invention may be any nucleic acid- 
based detection means such as, for example, nucleic acid hybridization or amplification 
reaction (eg. PCR), a nucleic acid sequence-based amplification (NASBA) system, 
inverse polymerase chain reaction (iPCR), in situ polymerase chain reaction, or reverse 
transcription polymerase chain reaction (RT-PCR), amongst others. 

15 

The probe can be labelled with a reporter molecule capable of producing an identifiable 
signal (e.g., a radioisotope such as 32 P or ^S, or a fluorescent or biotinylated molecule). 
According to this embodiment, those skilled in the art will be aware that the detection of 
said reporter molecule provides for identification of the probe and that, following the 
20 hybridization reaction, the detection of the corresponding nucleotide sequences in the 
sample is facilitated. Additional probes can be used to confirm the assay results obtained 
using a single probe. 

Wherein the detection means is an amplification reaction such as, for example, a 
25 polymerase chain reaction or a nucleic acid sequence-based amplification (NASBA) 
system or a variant thereof, one or more nucleic acid probes molecules of at least about 
20 contiguous nucleotides in length is hybridized to mRNA encoding a cancer-associated 
protein, or alternatively, hybridized to cDNA or cRNA produced from said mRNA, and 
nucleic acid copies of the template are enzymically-amplified. 

30 

Those skilled in the art will be aware that there must be a sufficiently high percentage of 
nucleotide sequence identity between the probes and the RNA sequences in the sample 
template molecule for hybridization to occur. As stated previously, the stringency 
conditions can be selected to promote hybridization. 

35 
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In one format, PGR provides for the hybridization of non-complementary probes to 
different strands of a double-stranded nucleic acid template molecule (ie. a DNA/RNA, 
RNA/RNA or DNA/DNA template), such that the hybridized probes are positioned to 
facilitate the 5 -to 3' synthesis of nucleic acid in the intervening region, under the control 
5 of a thermostable DNA polymerase enzyme. In accordance with this embodiment, one 
sense probe and one antisense probe as described herein would be used to amplify DNA 
from the hybrid RNA/DNA template or cDNA. 

In the present context the cDNA would generally be produced by reverse transcription of 
10 mRNA present in the sample being tested (ie. RT-PCR). RT-PCR is particularly useful 
when it is desirable to determine expression of a cancer-associated gene. It is also 
known to those skilled in the art to use m RNA/DNA hybrid molecules as a template for 
such amplification reactions, and, as a consequence, first strand cDNA synthesis is all 
that is required to be performed prior to the amplification reaction. 

15 

Variations of the embodiments described herein are described in detail by McPherson et 
a!., PCR: A Practical Approach, (series eds, D. Rickwood and B.D. Hames), IRL Press 
Limited, Oxford. pp1-253, 1991. 

20 The amplification reaction detection means described supra can be further coupled to a 
classical hybridization reaction detection means to further enhance sensitivity and 
specificity of the inventive method, such as by hybridizing the amplified DNA with a probe 
which is different from any of the probes used in the amplification reaction. 

25 Similarly, the hybridization reaction detection means described supra can be further 
coupled to a second hybridization step employing a probe which is different from the 
probe used in the first hybridization reaction. 

The comparison to be performed in accordance with the present invention may be a 
30 visual comparison of the signal generated by the probe, or alternatively, a comparison of 
data integrated from the signal, such as, for example, data that have been corrected or 
normalized to allow for variation between samples. Such comparisons can be readily 
performed by those skilled in the art. 
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Polypeptides 

Cancer-associated polypeptides are encoded by cancer-associated genes. It will be 
understood that such polypeptides include those polypeptide and fragments thereof that 
5 are homologous to the polypeptides encoded by the nucleotide sequences referred to in 
Tables 1-3, which are obtained from any source, for example related viral/bacterial 
proteins, cellular homologues and synthetic peptides, as well as variants or derivatives 
thereof. 

10 Thus, the present invention encompasses the use of variants, homologues or derivatives 
of the cancer-associated proteins descirbed in the accompanying Tables. In one 
embodiment, homologues are naturally occurring sequences, such as orthologues, tissue- 
specific isoforms and allelic variants. . 

15 In the context of the present invention, a homologous sequence is taken to include an 
amino acid sequence which is at least 60, 70, 80 or 90% identical, preferably at least 95 
or 98% identical at the amino acid level over at least 20, 40, 60 or 80 amino acids with a 
sequence encoded by a nucleotide sequence referred to in any one of Tables 1-3. In 
particular, homology should typically be considered with respect to those regions of the 

20 sequence known to be essential for specific biological functions rather than non-essential 
neighbouring sequences. 

Although amino acid homology can also be considered in terms of similarity (i.e. amino 
acid residues having similar chemical properties/functions), in the context of the present 
25 invention it is preferred to express homology in terms of sequence identity. 

Homology comparisons are carried out as described above for nucleotide sequences with 
the appropriate modifications for amino acid sequences. For example when using the GCG 
Wisconsin Bestfit package (see below) the default gap penalty for amino acid sequences is 
30 -12 for a gap and -4 for each extension. 

It should also be noted that where computer algorithms are used to align amino acid 
sequences, although the final % homology are measured in terms of identity, the 
alignment process itself is typically not based on an all-or-nothing pair comparison. 
35 Instead, a scaled similarity score matrix is generally used that assigns scores to each 
pairwise comparison based on chemical similarity or evolutionary distance. An example 
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of such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
further details). It is preferred to use the public default values for the GCG package, or in 
5 the case of other software, the default matrix, such as BLOSUM62. 

The terms "variant" or "derivative" in relation to the amino acid sequences of the present 
invention includes any substitution of, variation of, modification of, replacement of, deletion 
of or addition of one (or more) amino acids from or to the sequence providing the resultant 
10 amino acid sequence preferably has biological activity, preferably having at least 25 to 50% 
of the activity as the polypeptides referred to in the sequence listings, more preferably at 
least substantially the same activity. Particular details of biological activity for each 
polypeptide are given in Tables 1-3. 

15 Thus, the polypeptides referred to in Tables 1-3 and homologues thereof, are modified 
for use in the present invention. Typically, modifications are made that maintain the 
activity of the sequence. Thus, in one embodiment, amino acid substitutions are made, 
for example from 1, 2 or 3 to 10, 20 or 30 substitutions provided that the modified 
sequence retains at least about 25 to 50% of, or substantially the same activity. 

20 However, in an alternative preferred embodiment, modifications to the amino acid 
sequences of a cancer-associated protein are made intentionally to reduce the biological 
activity of the polypeptide. For example truncated polypeptides that remain capable of 
binding to target molecules but lack functional effector domains are useful as inhibitors of 
the biological activity of the full length molecule. 

25 

In general, preferably less than 20%, 10% or 5% of the amino acid residues of a variant or 
derivative are altered as compared with the corresponding region of the polypeptides 
referred to in Tables 1-3. 

30 Amino acid substitutions may include the use of non-naturally occurring analogues, for 
example to increase blood plasma half-life of a therapeutically administered polypeptide 
(see below for further details on the production of peptide derivatives for use in therapy). 

Conservative substitutions are made, for example according to the Table below. Amino 
35 acids in the same block in the second column and preferably in the same line in the third 
column are substituted for each other 
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Cancer-associated proteins also include fragments of the above mentioned full length 
polypeptides and variants thereof, including fragments of the sequences referred to in 
Tables 1-3 and homologues thereof. Preferred fragments include those which include an 
epitope. Suitable fragments will be at least about 6 or 8, e.g. at least 10, 12, 15 or 20 amino 
acids in length. They may also be less than 200, 100 or 50 amino acids in length. 
Polypeptide fragments may contain one or more (e.g. 2, 3, 5, or 10) substitutions, deletions 
or insertions, including conserved substitutions. Where substitutions, deletion and/or 
insertions have been made, for example by means of recombinant technology, preferably 
less than 20%. 10% or 5% of the amino acid residues depicted in the sequence listings are 
altered. 



15 



20 



Cancer-associated proteins are preferably in a substantially isolated form. It will be 
understood that the protein are mixed with carriers or diluents which will not interfere with 
the intended purpose of the protein and still be regarded as substantially isolated. A 
cancer-associated protein of the invention may also be in a substantially purified form, in 
which case it will generally comprise the protein in a preparation in which more than 
90%, e.g. 95%, 98% or 99% pure as determined by SDS/PAGE or other art-recognized 
means for asessing protein purity. 



25 



Protein Production 

For producing full-length polypeptides or immunologically active derivatives thereof by 
recombinant means, a protein-encoding region comprising at least about 15 contiguous 
nucleotides of the protein-encoding region of a nucleic acid referred to in any one of 
Tables 1-3 is placed in operable connection with a promoter or other regulatory 
sequence capable of regulating expression in a cell-free system or cellular system. 
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Reference herein to a "promoter" is to be taken in its broadest context and includes the j 
transcriptional regulatory sequences of a classical genomic gene, including the TATA 
box which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e., upstream activating sequences, 
5 enhancers and silencers) which alter, gene expression in response to developmental 
and/or external stimuli, or in a tissue-specific manner. In the present context, the term 
"promoter is also used to describe a recombinant, synthetic or fusion molecule, or. 
derivative which confers, activates or enhances the expression of a nucleic acid molecule 
to which it is operably connected,, and which encodes the polypeptide or peptide 
10 fragment. Preferred promoters can contain additional copies of one or more specific 
regulatory elements to further enhance expression and/or to alter the spatial expression 
and/or temporal expression of the said nucleic acid molecule. 

Placing a nucleic acid molecule under the regulatory control of, i.e., "in operable 
15 connection with", a promoter sequence means positioning said molecule such that 
expression is controlled by the promoter sequence. Promoters are generally positioned 
5' (upstream) to the coding sequence that they control. To construct heterologous 
promoter/structural gene combinations, it is generally preferred to position the promoter 
at a distance from the gene transcription start site that is approximately the same as the 
20 distance between that promoter and the gene it controls in its natural setting, i.e., the 
gene from which the promoter is derived. Furthermore, the regulatory elements 
comprising a promoter are usually positioned within 2 kb of the start site of transcription 
of the gene. As is known in the art, some variation in this distance can be accommodated 
without loss of promoter function. Similarly, the preferred positioning of a regulatory 
25 sequence element with respect to a heterologous gene to be placed under its control is 
defined by the positioning of the element in its natural setting, i.e., the genes from which 
it is derived. Again, as is known in the art, some variation in this distance can also occur. 

The prerequisite for producing intact polypeptides and peptides in bacteria such as E 
30 coli is the use of a strong promoter with an effective ribosome binding site. Typical 
promoters suitable for expression in bacterial cells such as £ coli include, but are not 
limited to, the lacz promoter, temperature-sensitive \ or Ar promoters, T7 promoter or 
the IPTG-inducible tac promoter. A number of other vector systems for expressing the 
nucleic acid molecule of the invention in E coli are well-known in the art and are 
35 described, for example, in Ausubel et al (In: Current Protocols in Molecular Biology. 
Wiley Interscience, ISBN 047150338, 1987) or Sambrook et al (In: Molecular cloning. A 



: J i \ 

WO 2004/022778 PCT/AU2003/001166 

65 

laboratory manual, second edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N.Y., 1989). Numerous plasmids with suitable promoter sequences for expression in 
bacteria and efficient ribosome binding sites have been described, such as for example, 
pKC30 (A, L : Shimatake and Rosenberg, Nature 292, 128, 1981); pKK173-3 (tac. Amann 
and Brosius, Gene 40, 183, 1985), pET-3 (77: Studier and Moffat, J. Mol. Biol. 189, 113, 
1986); the pBAD/TOPO or pBAD/Thio-TOPO series of vectors containing an arabinose- 
inducible promoter (Invitrogen, Carlsbad, CA), the latter of which is designed to also 
produce fusion proteins with thioredoxin to enhance solubility of the expressed protein; 
the pFLEX series of expression vectors (Pfizer Inc., CT, USA); or the pQE series of 
expression vectors (Qiagen, CA), amongst others. 

Typical promoters suitable for expression in viruses of eukaryotic cells and eukaryotic 
cells include the SV40 late promoter, SV40 early promoter and cytomegalovirus (CMV) 
promoter, CMV IE (cytomegalovirus immediate early) promoter amongst others. 
Preferred vectors for expression in mammalian cells (eg. 293, COS, CHO, 293T cells) 
include, but are not limited to, the pcDNA Vector suite supplied by Invitrogen, in particular 
pcDNA 3.1 myc-His-tag comprising the CMV promoter and encoding a C-terminal 6xHis 
and MYC tag; and the retrovirus vector pSRatkneo (Muller et a/., Mol. Cell. Biol., 11, 
1785, 1991). The vector pcDNA 3.1 myc-His (Invitrogen) is particularly preferred for 
expressing a secreted form of a protein in 293T cells, wherein the expressed peptide or 
protein can be purified tree of conspecffic proteins, using standard affinity techniques that 
employ a Nickel column to bind the protein via the His tag. 

A wide range of additional host/vector systems suitable for expressing polypeptides or 
immunological derivatives thereof are available publicly, and described, for example, in 
Sambrook et al (In: Molecular cloning. A laboratory manual, second edition, Cold 
Spring Harbor Laboratory, Cord Spring Harbor, N.Y., 1 989). 

Means for introducing the isolated nucleic acid molecule or a gene construct comprising 
same into a cell for expression are well-known to those skilled in the art. The technique 
used for a given organism depends on the known successful techniques. Means for 
introducing recombinant DNA into animal cells include microinjection, transfectJon 
mediated by DEAE-dextran, transfection mediated by liposomes such as by using 
lipofectamine (Gibco, MD, USA) and/or cellfectin (Gibco, MD, USA), PEG-mediated DNA 
uptake, electroporation and microparticle bombardment such as by using DNA-coated 
tungsten or gold particles (Agracetus Inc., Wl, USA) amongst others. 
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For producing mutants, nucleotide insertion derivatives of the protein-encoding region 
are produced by making 5' and 3' terminal fusions, or by making intra-sequence 
insertions of single or multiple nucleotides or nucleotide analogues. Insertion nucleotide 
sequence variants are produced by introducing one or more nucleotides or nucleotide 
analogues into a predetermined site in the nucleotide sequence of said sequence, 
although random insertion is also possible with suitable screening of the resulting product 
being performed. Deletion variants are produced by removing one or more nucleotides 
from the nucleotide sequence. Substitutional nucleotide variants are produced by 
substituting at least one nucleotide in the sequence with a different nucleotide or a 
nucleotide analogue in its place, with the immunologically active derivative encoded 
therefor having an identical amino acid sequence ■, or only a limited number of amino 
acid modifications that do not alter its antigenicity compared to the base peptide or its 
ability to bind antibodies prepared against the base peptide. Such mutant derivatives will 
preferably have at least 80% identity with the base amino acid sequence from which they 
are derived. 

Preferred immunologically active derivatives of a full-length polypeptide encoded by a 
gene referred to in any one of Tables 1-3 will comprise at least about 5-10 contiguous 
amino acids of the full-length amino acid sequence, more preferably at least about 10-20 
contiguous amino acids in length, and even more preferably 20-30 contiguous amino 
acids in length. 

For the purposes of producing derivatives using standard peptide synthesis techniques, 
such as, for example, Fmoc chemistry, a length not exceeding about 30-50 amino acids 
in length is preferred, as longer peptides are difficult to produce at high efficiency. 
Longer peptide fragments are readily achieved using recombinant DNA techniques 
wherein the peptide is expressed in a cell-free or cellular expression system comprising 
nucleic acid encoding the desired peptide fragment. 

It will be apparent to the skilled artisan that any sufficiently antigenic region of at least 
about 5-10 amino acid residues can be used to prepare antibodies that bind generally to 
the polypeptides listed in Tables 1-3. 

An expressed protein or synthetic peptide is preferably produced as a recombinant fusion 
protein, such as for example, to aid in extraction and purification. To produce a fusion 
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polypeptide, the open reading frames are covalently linked in the same reading frame, 
such as, for example, using standard cloning procedures as described by Ausubel et at. 
(Current Protocols in Molecular Biology, Wiley Interscience, ISBN 047150338, 1992), 
and expressed under control of a promoter. Examples of fusion protein partners include 
5 glutathione-S-transferase (GST). FLAG, hexahistidine, GAL4 (DNA binding and/or 
transcriptional activation domains) and p-galactosidase. It may also be convenient to 
include a proteolytic cleavage site between the fusion protein partner and the protein 
sequence of interest to allow removal of fusion protein sequences. Preferably the fusion 
protein will not hinder the immune function of the target protein. 

10 

In a particularly preferred embodiment, polypeptides are produced substantially free of 
conspecific proteins. Such purity can be assessed by standard procedures, such as, for 
example, SDS/polyacrylamide gel electrophoresis, 2-dimensional gene electrophoresis, 
chromatography, amino acid composition analysis, or amino acid sequence analysis. 

15 

To produce isolated polypeptides or fragments, eg., for antibody production, standard 
protein purification techniques may be employed. For example, gel filtration, ion 
exchange chromatography, reverse phase chromatography, or affinity chromatography, 
or a combination of any one or more said procedures, may be used. High pressure and 

20 low pressure procedures can also be employed, such as, for example, FPLC, or HPLC. 
To isolate the full-length proteins or peptide fragments comprising more than about 50- 
100 amino acids in length, it is particularly preferred to express the polypeptide in a 
suitable cellular expression system in combination with a suitable affinity tag, such as a 
6xHis tag, and to purify the polypeptide using an affinity step that bonds it via the tag 

25 (supra). Optionally, the tag may then be cleaved from the expressed polypeptide. 

Alternatively, for short immunologically active derivatives of a full-length polypeptide, 
preferably those peptide fragments comprising less than about 50 amino acids in length, 
chemical synthesis techniques are conveniently used. As will be known to those skilled 
30 in the art, such techniques may also produce contaminating peptides that are shorter 
than the desired peptide, in which case the desired peptide is conveniently purified using 
reverse phase and/or ion exchange chromatography procedures at high pressure (ie. 
HPLC or FPLC). 

35 Antibodies 
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The invention also provides monoclonal or polyclonal antibodies that bind specifically to 
polypeptides of the invention or fragments thereof. Thus, the present invention further 
provides a process for the production of monoclonal or polyclonal antibodies to polypeptides 
of the invention. 

5 

The phrase "binds specifically" to a polypeptide means that the binding of the antibody to 
the protein or peptide is determinative of the presence of the protein, in a heterogeneous 
population of proteins and other biologies. Thus, under designated immunoassay 
conditions, the specified antibodies bind to a particular protein at least two times the 
10 background and more typically more than 10 to 100 times background. Typically, 
antibodies of the invention bind to a protein of interest with a Kd of at least about 0.1 mM, 
more usually at least about 1 ^iM, preferably at least about 0.1 *iM, and most preferably at 
least, 0.01 nM. 

15 Reference herein to antibody or antibodies includes whole polyclonal and monoclonal 
antibodies, and parts thereof, either alone or conjugated with other moieties. Antibody 
parts include Fab and F(ab) 2 fragments and single chain antibodies. The antibodies may 
be made in vivo in suitable laboratory animals, or, in the case of engineered antibodies 
(Single Chain Antibodies or SCABS, etc) using recombinant DNA techniques in vitro. 

20 

In accordance with this aspect of the invention, the antibodies may be produced for the 
purposes of immunizing the subject, in which case high titer or neutralizing antibodies 
that bind to a B cell epitope will be especially preferred. Suitable subjects for 
immunization will, of course, depend upon the immunizing antigen or antigenic B cell 

25 epitope. It is contemplated that the present invention will be broadly applicable to the 
immunization of a wide range of animals, such as, for example, farm animals (e.g. 
horses, cattle, sheep, pigs, goats, chickens, ducks, turkeys, and the like), laboratory 
animals (e.g. rats, mice, guinea pigs, rabbits), domestic animals (cats, dogs, birds and 
the like), feral or wild exotic animals (e.g. possums, cats, pigs, buffalo, wild dogs and the 

30 like) and humans. 

Alternatively, the antibodies may be for commercial or diagnostic purposes, in which 
case the subject to whom the diagnostic/prognostic protein or immunogenic fragment or 
epitope thereof is administered will most likely be a laboratory or farm animal. A wide 
35 range of animal species are used for the production of antisera. Typically the animal 
used for production of antisera is a rabbit, a mouse, rat, hamster, guinea pig, goat, 
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sheep, pig, dog, horse, or chicken. Because Of the relatively large blood volume of 
rabbits, a rabbit is a preferred choice for production of polyclonal antibodies. However, 
as will be known to those skilled in the art, larger amounts of immunogen are required to 
obtain high antibodies from large animals as opposed to smaller animals such as mice. 
In such cases, it will be desirable to isolate the antibody from the immunized animal. 

Preferably, the antibody is a high titer antibody. By "high titer" means a sufficiently high 
titer to be suitable for use in diagnostic or therapeutic applications. As will be known in 
the art, there is some variation in what might be considered "high titer". For most 
applications a titer of at least about 10 3 -10 4 is preferred. More preferably, the antibody 
titer will be in the range from about 10 4 to about 10 5 , even more preferably in the range 
from about 1 0 5 to about 1 0 6 . 

More preferably, in the case of B cell epitopes from pathogens, viruses or bacteria, the 
antibody is a neutralizing antibody (i.e. it is capable of neutralizing the infectivity of the 
organism fro which the B cell epitope is derived). 



To generate antibodies, the diagnostic/prognostic protein or immunogenic fragment or 
epitope thereof, optionally formulated with any suitable or desired carrier, adjuvant, BRM, 

20 or pharmaceutical^ acceptable excipient, is conveniently administered in the form of an 
injectable composition. Injection, may be intranasal, intramuscular, sub-cutaneous, 
intravenous, intradermal, intraperitoneal, or by other known route. For intravenous 
injection, it is desirable to include one or more fluid and nutrient replenishes. Means for 
preparing and characterizing antibodies are well known in the art (See, e.g., 

25 ANTIBODIES: A LABORATORY MANUAL, Cold Spring Harbor Laboratory, 1988, 
incorporated herein by reference). 



The efficacy of the diagnostic/prognostic protein or immunogenic fragment or epitope 
thereof in producing an antibody is established by injecting an animal, for example, a 

30 mouse, rat, rabbit, guinea pig, dog, horse, cow, goat or pig, with a formulation comprising 
the diagnostic/prognostic protein or immunogenic fragment or epitope thereof, and then 
monitoring the immune response to the B cell epitope, as described in the Examples. 
Both primary and secondary immune responses are monitored. The antibody titer is 
deteimined using any conventional immunoassay, such as, for example, ELISA, or radio 

35 immunoassay. 
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The production of polyclonal antibodies may be monitored by sampling blood of the 
immunized animal at various points following immunization. A second, booster injection, 
may be given, if required to achieve a desired antibody titer. The process of boosting 
and titering is repeated until a suitable titer is achieved. When a desired level of 
5 immunogenicity is obtained, the immunized animal is bled and the serum isolated and 
stored, and/or the animal is used to generate monoclonal antibodies (Mabs). 

For the production of monoclpnal antibodies (Mabs) any one of a number of well-known 
techniques may be used, such as, for example, the procedure exemplified in US Patent 
10 No. 4, 1 96,265, incorporated herein by reference. 

For example, a suitable animal will be immunized with an effective amount of the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof under 
conditions sufficient to stimulate antibody producing cells. Rodents such as mice and 
15 rats are preferred animals, however, the use of rabbit sheep, or frog cells is also 
possible. The use of rats may provide certain advantages, but mice are preferred, with 
the BALB/c mouse being most preferred as the most routinely used animal and one that 
generally gives a higher percentage of stable fusions. 

20 Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. 
These cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a 
peripheral blood sample. Spleen cells and peripheral blood cells are preferred, the former 
because they are a rich source of antibody-producing cells that are in the dividing 

25 plasmablast stage, and the latter because peripheral blood is easily accessible. Often, a 
panel of animals will have been immunized and the spleen of animal with the highest 
antibody titer removed. Spleen lymphocytes are obtained by homogenizing the spleen 
with a syringe. Typically, a spleen from an immunized mouse contains approximately 5 x 
1 0 7 to 2 x 1 0 8 lymphocytes. 

30 

The B cells from the immunized animal are then fused with cells of an immortal myeloma 
cell, generally derived from the same species as the animal that was immunized with the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof. Myeloma cell 
lines suited for use in hybridoma-producing fusion procedures preferably are non- 
35 antibody-producing, have high fusion efficiency and enzyme deficiencies that render 
them incapable of growing in certain selective media which support the growth of only the 
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desired fused cells, or hybridomas. Any one of a number of myeloma cells may be used 
and these are known to those of skill in the art (e.g. murine P3-X63/Ag8, X63-Ag8.653, 
NS1/1.Ag 4 1, Sp210-Agl4, FO, NSO/U, MPC-11, MPC1 1-X45-GTG 1.7 and 
S194/5XX0; or rat R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500- 
GRG2, LICR-LON-HMy2 and UC729-6). A preferred murine myeloma cell is the NS-1 
myeloma cell line (also termed P3-NS-1-Ag4-1), which is readily available from the 
NIGMS Human Genetic Mutant Cell Repository under Accession No. GM3573. 
Alternatively, a murine myeloma SP2/0 non-producer cell line that is 8-azaguanine- 
resistant is used. 

To generate hybrids of antibody-producing spleen or lymph node cells and myeloma 
cells, somatic cells are mixed with myeloma cells in a proportion between about 20:1 to 
about 1:1, respectively, in the presence of an agent or agents (chemical or electrical) that 
promote the fusion of cell membranes. Fusion methods using Sendai virus have been 
described by Kohler and Milstein, Nature 256, 495-497, 1975; and Kohler and Miistein, 
Eur. J. Immunol. 6, 511-519, 1976. Methods using polyethylene glycol (PEG), such as 
37% (v/v) PEG, are described in detail by Gefter et a/., Somatic Cell Genet. 3, 231-236, 
1977. The use of electrically induced fusion methods is also appropriate. 

Hybrids are amplified by culture in a selective medium comprising an agent that blocks 
the de novo synthesis of nucleotides in the tissue culture media. Exemplary and 
preferred agents are aminopterin, methotrexate and azaserine. Aminopterin and 
methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the 
media is supplemented with hypoxanthine and thymidine as a source of nucleotides 
(HAT medium). Where azaserine is used, the media is supplemented with hypoxanthine. 

The preferred selection medium is HAT, because only those hybridomas capable of 
operating nucleotide salvage pathways are able to survive in HAT medium, whereas 
myeloma cells are defective in key enzymes of the salvage pathway, (e.g., hypoxanthine 
phosphoribosyl transferase or. HPRT), and they cannot survive. B cells can operate this 
salvage pathway, but they have a limited life span in culture and generally die within 
about two weeks. Accordingly, the only cells that can survive in the selective media are 
those hybrids formed from myeloma and B cells. 
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The amplified hybridomas are subjected to a functional selection for antibody specificity 
and/or titer, such as, for example, by immunoassay (e.g. radioimmunoassay, enzyme 
immunoassay, cytotoxicity assay, plaque assay, dot immunobinding assay, and the like). 

The selected hybridomas are serially diluted and cloned into individual antibody- 
producing cell lines, which clones can then be propagated indefinitely to provide MAbs. 
The cell lines may be exploited for MAb production in two basic ways. A sample of the 
hybridoma is injected, usually in the peritoneal cavity, into a histocompatible animal of 
the type that was used to provide the somatic and myeloma cells for the original fusion. 
The -injected animal develops tumors secreting the specific monoclonal antibody 
produced by the fused cell hybrid. The body fluids of the animal, such as serum or 
ascites fluid, can then be tapped to provide MAbs in high concentration. The individual 
cell lines could also be cultured in vitro, where the MAbs are naturally secreted into the 
culture medium from which they are readily obtained in high concentrations. MAbs 
produced by either means may be further purified, if desired, using filtration, 
centrifugation and various chromatographic methods such as HPLC or affinity 
chromatography. 

Monoclonal antibodies of the present invention also include anti-idiotypic antibodies 
produced by methods well-known in the art. Monoclonal antibodies according to the 
present invention also may be monoclonal heteroconjugates, (i.e., hybrids of two or more 
antibody molecules). In another embodiment, monoclonal antibodies according to the 
invention are chimeric monoclonal antibodies. In one approach, the chimeric monoclonal 
antibody is engineered by cloning recombinant DNA containing the promoter, leader, and 
variable-region sequences from a mouse anti-PSA producing cell and the constant- 
region exons from a human antibody gene. The antibody encoded by such a 
recombinant gene is a mouse-human chimera. Its antibody specificity is determined by 
the variable region derived from mouse sequences. Its isotype, which is determined by 
the constant region, is derived from human DNA. 

In another embodiment, the monoclonal antibody according to the present invention is a 
"humanized" monoclonal antibody, produced by any one of a number of techniques well- 
known in the art That is, mouse complementary determining regions ("CDRs") are 
transferred from heavy and light V-chains of the mouse Ig into a human V-domain, 
followed by the replacement of some human residues in the framework regions of their 
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murine counterparts. "Humanized" monoclonal antibodies in accordance with this 
invention are especially suitable for use in vivo in diagnostic and therapeutic methods. 

As stated above, the monoclonal antibodies and fragments thereof according to this 
invention are multiplied according to in vitro and in vivo methods well-known in the art. 
Multiplication in vitro is carried out in suitable culture media such as Dulbecco's modified 
Eagle medium or RPMI 1640 medium, optionally replenished by a mammalian serum 
such as fetal calf serum or trace elements and growth-sustaining supplements, e.g., 
feeder cells, such as normal mouse peritoneal exudate cells, spleen cells, bone marrow 
macrophages or the like. In vitro production provides relatively pure antibody 
preparations and allows scale-up to give large amounts of the desired antibodies. 
Techniques for large scale hybridoma cultivation under tissue culture conditions are 
known in the art and include homogenous suspension culture, (e.g., in an airlift reactor or 
in a continuous stirrer reactor or immobilized or entrapped cell culture). 

Large amounts of the monoclonal antibody of the present invention also may be obtained 
by multiplying hybridoma cells in vivo. Cell clones are injected into mammals which are 
histocompatible with the parent cells, (e.g., syngeneic mice, to cause growth of antibody- 
producing tumors. Optionally, the animals are primed with a hydrocarbon, especially oils 
such as Pristane (tetramethylpentadecane) prior to injection. 

In accordance with the present invention, fragments of the monoclonal antibody of the 
invention are obtained from monoclonal antibodies produced as described above, by 
methods which include digestion with enzymes such as pepsin or papain and/or 
cleavage of disulfide bonds by chemical reduction. Alternatively, monoclonal antibody 
fragments encompassed by the present invention are synthesized using an automated 
peptide synthesizer, or they may be produced manually using techniques well known in 
the art. 

The monoclonal conjugates of the present invention are prepared by methods known in 
the art, e.g., by reacting a monoclonal antibody prepared as described above with, for 
instance, an enzyme in the presence of a coupling agent such as glutaraldehyde or 
periodate. Conjugates with fluorescein markers are prepared in the presence of these 
coupling agents, or by reaction with an Isothiocyanate. Conjugates with metal chelates 
are similarly produced. Other moieties to which antibodies may be conjugated include 
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radionuclides such as, for example; 3 H, 125 l, *P, .^S, 14 C, 51 Cr, ^Cl, ™Co, 58 Co, 59 Fe, 
75 Se, and 152 Eu. 

Radioactively labeled monoclonal antibodies of the present invention are produced 
5 according to well-known methods in the art For instance, monoclonal antibodies are 
iodinated by contact with sodium or potassium iodide and a chemical oxidizing agent 
such as sodium hypochlorite, or an enzymatic oxidizing agent, such as lactoperoxidase. 
Monoclonal antibodies according to the invention may be labeled with technetium" by 
ligand exchange process, for example, by reducing pertechnetate with stannous solution, 
10 chelating the reduced technetium onto a Sephadex column and applying the antibody to 
this column or by direct labeling techniques, (e.g., by incubating pertechnate, a reducing 
agent such as SNCfe, a buffer solution such as sodium-potassium phthalate solution, and 
the antibody). 

15 Any immunoassay may be used to monitor antibody production by the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof . 
Immunoassays, in their most, simple and direct sense, are binding assays. Certain 
preferred immunoassays are the various types of enzyme linked immunosorbent assays 
(ELlSAs) and radioimmunoassays (RIA) known in the art. Immunohistochemical 

20 detection using tissue sections is also particularly useful. However, it will be readily 
appreciated that detection is not limited to such techniques, and Western blotting, dot 
blotting, FACS analyses, and the like may also be used. 

Most preferably, the assay will be capable of generating quantitative results. 

25 

For example, antibodies are tested in simple competition assays. A known antibody 
preparation that binds to the B cell epitope and the test antibody are incubated with an 
antigen composition comprising the B cell epitope, preferably in the context of the native 
antigen. "Antigen composition" as used herein means any composition that contains 

30 some version of the B cell epitope in an accessible form. Antigen-coated wells of an 
ELISA plate are particularly preferred. In one embodiment, one would pre-mix the known 
antibodies with varying amounts of the test antibodies (e.g., 1:1, 1:10 and 1:100) for a 
period of time prior to applying to the antigen composition. If one of the known antibodies 
is labeled, direct detection of the label bound to the antigen is possible; comparison to an 

35 unmixed sample assay will determine competition by the test antibody and, hence, cross- 
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reactivity. Alternatively, using secondary antibodies specific for either the known or test 
antibody, one will be able to determine competition. 

An antibody that binds to the antigen composition will be able to effectively compete for 
5 binding of the known antibody and thus will significantly reduce binding of the latter. The 
reactivity of the known antibodies in the absence of any test antibody is the control. A 
significant reduction in reactivity in the presence of a test antibody is indicative of a test 
antibody that binds to the B ceil epitope (i.e., it cross-reacts with the known antibody). 

10 In one exemplary ELISA, the antibodies against the diagnostic/prognostic protein or 
immunogenic fragment or B cell epitope are immobilized onto a selected surface 
exhibiting protein affinity, such as a well in a polystyrene microtiter plate. Then, a 
composition containing a peptide comprising the B cell epitope is added to the wells. 
After binding and washing to remove non-specifically bound immune complexes, the 

15 bound epitope may be detected. Detection is generally achieved by the addition of a 
second antibody that is known to bind to the B cell epitope and is linked to a detectable 
label. This type of ELISA is a simple "sandwich ELISA". Detection may also be achieved 
by the addition of said second antibody, followed by the addition of a third antibody that 
has binding affinity for the second antibody, with the third antibody being linked to a 

20 detectable label. 

Antibodies of the invention may be bound to a solid support and/or packaged into kits in 
a suitable container along with suitable reagents, controls, instructions and the like. 

25 Immunoassay formats 

In one embodiment, a cancer-associated protein or an immunogenic fragment or epitope 
thereof is detected in a patient sample, wherein the level of the protein or immunogenic 
fragment or epitope in the sample is indicative of ovarian cancer or disease recurrence or 
an indicator of poor survival. Preferably, the method comprises contacting a biological 

30 sample derived from the subject with an antibody capable of binding to a cancer- 
associated protein or an immunogenic fragment or epitope thereof, and detecting the 
formation of an antigen-antibody complex. 

In another embodiment, an antibody against a cancer-associated protein or epitope 
35 thereof is detected in a patient sample, wherein the level of the antibody in the sample is 
indicative of ovarian cancer or disease recurrence or an indicator of poor survival. 
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Preferably, the method comprises contacting a biological sample derived from the 
subject with a cancer-associated protein or an antigenic fragment eg., a B cell epitope or 
other immunogenic fragment thereof, and detecting the formation of an antigen-antibody 
complex. 

The diagnostic assays of the invention are useful for determining the progression of 
ovarian cancer or a metastasis thereof in a subject. In accordance with these prognostic 
applications of the invention, the level of a cancer-associated protein or an immunogenic 
fragment or epitope thereof in a biological sample is correlated with the disease state eg., 
as determined by clinical symptoms or biochemical tests (eg., CA125 levels). 

Accordingly, a further embodiment of the invention provides a method for detecting a 
cancer cell in a subject, said method comprising: 

(i) determining the level of a cancer-associate protein in a test sample from said 
subject; and 

(ii) comparing the level determined at (i) to the level of said cancer-associated 
protein in a comparable sample from a healthy or normal individual, 

wherein a level of said cancer-associate protein at (i) that is modified in the test sample 
relative to the comparable sample from the normal or healthy individual is indicative of 
the presence of a cancer cell in said subject 

In one embodiment of the diagnostic/prognostic methods described herein, the biological 
sample is obtained previously from the subject. In accordance with such an 
embodiment, the prognostic or diagnostic method is performed ex vivo. 

In yet another embodiment, the subject diagnostic/prognostic methods further comprise 
processing the sample from the subject to produce a derivative or extract that comprises 
the analyte. 

Preferred detection systems contemplated herein include any known assay for detecting 
proteins or antibodies in a biological sample isolated from a human subject, such as, for 
example, SDS/PAGE, isoelectric focussing, 2-dimensional gel electrophoresis 
comprising SDS/PAGE and isoelectric focussing, an immunoassay, a detection based 
system using an antibody or non-antibody ligand of the protein, such as, for example, a 
small molecule (e.g. a chemical compound, agonist, antagonist, allosteric modulator, 
competitive inhibitor, or non-competitive inhibitor, of the protein). In accordance with 
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these embodiments, the antibody or small molecule may be used in any standard solid 
phase or solution phase assay format amenable to the detection of proteins. Optical or 
fluorescent detection, such as, for example, using mass spectrometry, MALDI-TOF, 
biosensor technology, evanescent fiber optics, or fluorescence resonance energy 
5 transfer, is clearly encompassed by the present invention. Assay systems suitable for 
use in high throughput screening of mass samples, particularly a high throughput 
spectroscopy resonance, method (e.g. MALDI-TOF, electrospray MS or nano- 
electrospray MS), are particularly contemplated. 

10 Immunoassay formats are particularly preferred, eg., selected from the group consisting 
of, an immunoblot, a Western blot, a dot blot, an enzyme linked immunosorbent assay 
(ELISA), radioimmunoassay (RIA), enzyme immunoassay. Modified immunoassays 
utilizing fluorescence resonance energy transfer (FRET), isotope-coded affinity tags 
(ICAT), matrix-assisted laser desorption/ionization time of flight (MALDI-TOF), 

15 electrospray ionization (ESI), biosensor technology, evanescent fiber-optics technology 
or protein chip technology are also useful. 

Preferably, the assay is a semi-quantitative assay or quantitative assay. 

20 Standard solid phase ELISA formats are particularly useful in determining the 
concentration of a protein or antibody from a variety of patient samples. 

In one form such as an assay involves immobilising a biological sample comprising 
antibodies against the cancer-associated protein or epitope, or alternatively an ovarian 
25 cancer-associated protein or an immunogenic fragment thereof, onto a solid matrix, such 
as, for example a polystyrene or polycarbonate microwell or dipstick, a membrane, or a 
glass support (e.g. a glass slide). 

In the case of an antigen-based assay, an antibody that specifically binds an ovarian 
30 cancer-associated protein is brought into direct contact with the immobilised biological 
sample, and forms a direct bond with any of its target protein present in said sample. For 
an antibody-based assay, an immobilized ovarian cancer-associated protein or an 
immunogenic fragment or epitope thereof is contacted with the sample. The added 
antibody or protein in solution is generally labelled with a detectable reporter molecule, . 
35 such as for example, a fluorescent label (e.g. FITC or Texas Red) or an enzyme (e.g. 
horseradish peroxidase (HRP)), alkaline phosphatase (AP) or p-galactosidase. 
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Alternatively, or in addition, a second labelled antibody can be used that binds to the firet 
antibody or to the isolated/recombinant antigen. Following washing to remove any 
unbound antibody or antigen, as appropriate, the label is detected either directly, in the 
case of a fluorescent label, or through the addition of a substrate, such as for example 
5 hydrogen peroxide, TMB, or toluidine, or 5-bromo-4-chloro<3-indol-beta-I> 
galaotopyranoside (x-gai). 

Such ELISA based systems are particularly suitable for quantification of the amount of a 
protein or antibody in a sample, such as, for example, by calibrating the detection system 
10 against known amounts of a standard. 

In another form, an ELISA consists of immobilizing an antibody that specifically binds an 
ovarian cancer-associated protein on a solid matrix, such as, for example, a membrane, 
a polystyrene or polycarbonate microwell, a polystyrene or polycarbonate dipstick or a 
15 glass support. A patient sample is then brought into physical relation with said antibody, 
and the antigen in the sample is bound or 'captured'. The bound protein can then be 
detected using a labelled antibody. For example if the protein is captured from a human 
sample, an anti-human antibody is used to detect the captured protein. Alternatively, a 
third labelled antibody can be used that binds the second (detecting) antibody. 

20 

It will be apparent to the skilled person that the assay formats described herein are 
amenable to high throughput formats, such as, for example automation of screening 
processes, or a microarray format as described in Mendoza et a/, Biotechniaues 27(4): 
778-788, 1999. Furthermore, variations of the above described assay will be apparent to 
25 those skilled in the art, such as, for example, a competitive ELISA. 

Alternatively, the presence of antibodies against the cancer-associate protein, or 
alternatively an oarian cancer-associated protein or an immunogenic fragment thereof, is 
detected using a radioimmunoassay .(RIA). The basic principle of the assay is the use of 

30 a radiolabelled antibody or antigen to detect antibody antigen interactions. For example, 
an antibody that specifically binds to an ovarian cancer-associated protein can be bound 
to a solid support and a biological sample brought into direct contact with said antibody. 
To detect the bound antigen, an isolated and/or recombinant form of the antigen is 
radiolabelled is brought into contact with the same antibody. Following washing the 

35 amount of bound radioactivity is detected. As any antigen in the biological sample inhibits 
binding of the radiolabelled antigen the amount of radioactivity detected is inversely 
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proportional to the amount of antigen in the sample. Such an assay may be quantitated 
by using a standard curve using increasing known concentrations of the isolated antigen. 

As will be apparent to the skilled artisan, such an assay may be modified to use any 
reporter molecule, such as, for example, an enzyme or a fluorescent molecule, in place 
of a radioactive label. 

Western blotting is also useful for detecting an ovarian cancer-associated protein or an 
immunogenic fragment thereof. In such an assay protein from a biological sample is 
separated using sodium dodecyl sulphate (SDS) polyacrylamide gel electrophoresis 
(SDS-PAGE) using techniques well known in the art and described in, for example, 
Scopes (Jm Protein Purification: Principles and Practice. Third Edition, Springer Verlag, 
1994). Separated proteins are then transferred to a solid support, such as, for example, 
a membrane or more specifically PVDF membrane, using methods well known in the art, 
for example, etectrotransfer. This membrane may then be blocked and probed with a 
labelled antibody or ligand.that specifically binds an ovarian cancer-associated protein. 
Alternatively, a labelled secondary, or even tertiary, antibody or ligand can be used to 
detect the binding of a specific primary antibody. 

High-throughput methods for detecting the presence or absence of antibodies, or 
alternatively ovarian cancer-associated protein or an immunogenic fragment thereof are 
particularly preferred. 

In one embodiment, MALDI-TOF is used for the rapid identification of a protein. 
Accordingly, there is no need to detect the proteins of interest using an antibody or ligand 
that specifically binds to the protein of interest. Rather, proteins from a biological sample 
are separated using gel electrophoresis using methods well known in the art and those 
proteins at approximately the correct molecular weight and/or isoelectric point are 
analysed using MALDI-TOF to determine the presence or absence of a protein of 
interest. 

Alternatively, MALDI or ESI or a combination of approaches is used to determine the 
concentration of a particular protein in a biological sample, such as, for example sputum. 
Such proteins are preferably well characterised previously with regard to parameters 
such as molecular weight and isoelectric point. 
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Biosensor devices generally employ an electrode surface in combination with current or 
impedance measuring elements to be integrated into a device, in combination with the 
assay substrate (such as that described in U.S. Patent No. 5,567,301). An antibody or 
ligand that specifically binds to a protein of interest is preferably incorporated onto the 
5 surface of a biosensor device and a biological sample isolated from a patient (for 
example sputum that has been solubilised using the methods described herein) 
contacted to said device. A change in the detected current or impedance by the 
biosensor device indicates protein binding to said antibody or ligand. Some forms of 
biosensors known in the art also rely on surface plasmon resonance to detect protein 
10 interactions, whereby a change in the surface plasmon resonance surface of reflection is 
indicative of a protein binding to a ligand or antibody (U.S. Patent No. 5,485,277 and 
5,492,840). 

Biosensors are of particular use in high throughput analysis due to the ease of adapting 
15 such systems to micro- or nano-scales. Furthermore, such systems are conveniently 
adapted to incorporate several detection reagents, allowing for multiplexing of diagnostic 
reagents in a single biosensor unit. This permits the simultaneous detection of several 
epitopes in a small amount of body fluids. 

20 Evanescent biosensors are also preferred as they do not require the pretreatment of a 
biological sample prior to detection of a protein of interest. An evanescent biosensor 
generally relies upon light of a predetermined wavelength interacting with a fluorescent 
molecule, such as for example, a fluorescent antibody attached near the probe's surface, 
to emit fluorescence at a different wavelength upon binding of the diagnostic protein to 

25 the antibody or ligand. 

To produce protein chips, the proteins, peptides, polypeptides, antibodies or ligands that 
are able to bind specific antibodies or proteins of interest are bound to a solid support 
such as for example, glass, polycarbonate, polyletrafluoroethylene, polystyrene, silicon 

30 oxide, metal or silicon nitride. This immobilization is either direct (e.g. by covalent 
linkage, such as, for example, Schiffs base formation, disulfide linkage, or amide or urea 
bond formation) or indirect. Methods of generating a protein chip are known in the art and 
are described in for example U.S. Patent Application No. 20020136821, 20020192654, 
20020102617 and U.S. Patent No. 6,391,625. In order to bind a protein to a solid support 

35 it is often necessary to treat the solid support so as to create chemically reactive groups 
on the surface, such as, for example, with an aldehyde-containing silane reagent. 
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Alternatively, an antibody or ligand may be raptured on a microfabricated polyacrylamide 
gel pad and accelerated into the gel using microelectrophoresis as described in, Arenkov 
et al Anal. Biochem. 278: 1 23-1 31 , 2000. 

5 A protein chip is preferably generated such that several proteins, ligands or antibodies 
are arrayed on said chip. This format permits the simultaneous screening for the 
presence of several proteins in a sample. 

Alternatively, a protein chip may comprise only one protein, ligand or antibody, and be 
10 used to screen one or more patient samples for the presence of one polypeptide of 
interest. Such a chip may also be used to simultaneously screen an array of patient 
samples for a polypeptide of interest 

Preferably, a sample to be analysed using a protein chip is attached to a reporter 
15 molecule, such as, for example, a fluorescent molecule, a radioactive molecule, an 
enzyme, or an antibody that is detectable using methods well known in the art. 
Accordingly, by contacting a protein chip with a labelled sample and subsequent washing 
to remove any unbound proteins the presence of a bound protein is detected using 
methods well known in the art, such as, for example using a DNA microarray reader. 

20 

Alternatively, biomolecular interaction analysis-mass spectrometry (BIA-MS) is used to 
rapidly detect and characterise a protein present in complex biological samples at the 
low- to sub-fmole level (Nelson et al Electrophoresis 21: 1155-1163, 2000).' One 
technique useful in the analysis of a protein chip is surface enhanced laser 
25 desorption/ionization-time of flight-mass spectrometry (SELDI-TOF-MS) technology to 
characterise a protein bound to the protein chip. Alternatively, the protein chip is 
analysed using ESI as described in U.S. Patent Application 20020139751. 

As will be apparent to the skilled artisan, protein chips are particularly amenable to 
30 multiplexing of detection reagents. Accordingly, several antibodies or ligands each able 
to specifically bind a different peptide or protein may be bound to different regions of said 
protein chip. Analysis of a biological sample using said chip then permits the detecting of 
multiple proteins of interest, or multiple B cell epitopes of the ovarian cancer-associated 
protein. Multiplexing of diagnostic and prognostic mariners is particularly contemplated in 
35 the present invention. 
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In a further embodiment, the samples are analysed using ICAT, essentially as described 
in US Patent Application No. 20020076739. This system relies upon the labelling of a 
protein sample from one source (i.e. a healthy individual) with a reagent and the labelling 
of a protein sample from another source (i.e. a tuberculosis patient) with a second 
5 reagent that is chemically identical to the first reagent, but differs in mass due to isotope 
composition. It is preferable that the first and second reagents also comprise a biotin 
molecule. Equal concentrations of the two samples are then mixed, and peptides 
recovered by avidin affinity chromatography. Samples are then analysed using mass 
spectrometry. Any difference in peak heights between the heavy and light peptide ions 
10 directly correlates with a difference in protein abundance in a biological sample. The 
identity of such proteins may then be determined using a method well known in the art, 
such as, for example MALDI-TOF, or ESI. 

As will be apparent to those skilled in the art a diagnostic or prognostic assay described 
15 herein may be a multiplexed assay. As used herein the term "multiplex", shall be 
understood not only to mean the detection of two or more diagnostic or prognostic 
markers in a single sample simultaneously, but also to encompass consecutive detection 
of two or more diagnostic or prognostic markers in a single sample, simultaneous 
detection of two or more diagnostic or prognostic markers in distinct but matched 
20 samples, and consecutive detection of two or more diagnostic or prognostic markers in 
distinct but matched samples. As used herein the term "matched samples" shall be 
understood to mean two or more samples derived from the same initial biological sample, 
or two or more biological samples isolated at the same point in time. 

25 Accordingly, a multiplexed assay may comprise an assay that detects several antibodies 
and/or epitopes in the same reaction and simultaneously, or alternatively, it may detect 
other one or more antigens/antibodies in addition to one or more antibodies and/or 
epitopes. As will be apparent to the skilled artisan, if such an assay is antibody or ligand 
based, both of these antibodies must function under the same conditions. 

30 

Diagnostic assay kits 

A further aspect of the present invention provides a kit for detecting M. tuberculosis 
infection in a biological sample. In one embodiment, the kit comprises: 
(I) one or more isolated antibodies that bind to an ovarian cancer-associated protein 
35 or an immunogenic fragment or epitope thereof; and 

(ii) means for detecting the formation of an antigen-antibody complex. 
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In an alternative embodiment, the kit comprises: 
(i) an isolated or recombinant ovarian cancer-associated protein or an immunogenic 
fragment or epitope thereof; and 
5 (ii) means for detecting the formation of an antigen-antibody complex. 

Optionally, the kit further comprises means for the detection of the binding of an 
antibody, fragment thereof or a ligand to an ovarian cancer-associated protein. Such 
means include a reporter molecule such as, for example, an enzyme (such as 
10 horseradish peroxidase or alkaline phosphatase), a substrate, a cofactor, an inhibitor, a 
dye, a radionucleotide, a luminescent group, a fluorescent group, biotin or a colloidal 
particle, such as colloidal gold or selenium. Preferably such a reporter molecule is 
directly linked to the antibody or ligand. 

15 In yet another embodiment, a kit may additionally comprise a reference sample. Such a 
reference sample. 

In another embodiment, a reference sample comprises a peptide that is detected by an 
antibody or a ligand. Preferably, the peptide is of known concentration. Such a peptide 
20 is of particular use as a standard. Accordingly various known concentrations of such a 
peptide may be detected using a prognostic or diagnostic assay described herein. 

In yet another embodiment a kit comprises means for protein isolation (Scopes (In: 
Protein Purification: Principles and Practice, Third Edition, Springer Verlag, 1994). 

25 

Bioinformatics 

The ability to identify genes that are over or under expressed in ovarian cancer can 
additionally provide high-resolution, high-sensitivity datasets which are used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

30 biosensor development, and other related areas. For example, the expression profiles 
are used in diagnostic or prognostic evaluation of patients with ovarian cancer. Or as 
another example, subcellular toxicological information are generated to better direct drug 
structure and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the IBC Proteomics conference, 

35 Coronado, CA (June 11-12, 1998)). Subcellular toxicological information can also be 
utilized in a biological sensor device to predict the likely toxicological effect of chemical 
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exposures and likely tolerable exposure thresholds (see U.S. Patent No. 5,811,231).- 
Similar advantages accrue from datasets relevant to other biomolecules and bioactive 
agents (e.g., nucleic acids, saccharides, lipids, drugs, and the like). 

5 Thus, in another embodiment, the present invention provides a database that includes at 
least one set of assay data. The data contained in the database is acquired, e.g., using 
array analysis either singly or in a library format. The database are in substantially any 
form in which data are maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention are maintained on any electronic 
10 device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area network, such as the World Wide 
Web, 

The focus of the present section on databases that include peptide sequence data is for 
15 clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases are assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a 

20 biological sample undergoing ovarian cancer, i.e., the identification of ovarian cancer- 
associated sequences described herein, provide an abundance of information, which are 
correlated with pathological conditions, predisposition to disease, drug testing, 
therapeutic monitoring, gene-disease causal linkages, identification of correlates of 
immunity and physiological status, among others. Although the data generated from the 

25 assays of the invention is suited for manual review and analysis, in a preferred 
embodiment, prior data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known in the 
art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 

30 system for storing biomolecular sequence information in a manner that allows sequences 
to be catalogued and searched according to one or more protein function hierarchies. 
U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA 
sequences to be catalogued and searched according to association with one or more 

35 sequencing projects for obtaining full-length sequences from the collection of partial 
length sequences. U.S. Patent 5,706,498 discloses a gene database retrieval system for 
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making a retrieval of . a gene sequence similar to a sequence data item in a gene 
database based on the degree of similarity between a key sequence and a target 
sequence. U.S. Patent 5,538,897 discloses a method using mass spectroscopy 
fragmentation patterns of peptides to identify amino acid sequences in computer 
5 databases by comparison of predicted mass spectra with experimentally-derived mass 
spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. 
10 U.S. Patent 5,295,261 reports a hybrid database structure in which the fields of each 
database record are divided into two classes, navigational and informational data, with 
navigational fields stored in a hierarchical topological map which are viewed as a tree 
structure or as the merger of two or more such tree structures. 

15 See also Mount et al, Bioinformatics (2001); Biological Sequence Analysis: Probabilistic 
Models of Proteins and Nucleic Acids (Durbin et a/., eds., 1999); Bioiraformatics: A 
Practical Guide to the Analysis of Genes and Proteins (Baxevanis & Oeullette eds., 
1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological Science and 
Medicine (1999); Introduction to Computational Molecular Biology (Setubal ef aL, eds 

20 1997); Bioinformatics: Methods, and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinfor7natics: A Biologists Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes 

25 (1995). 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, 
e.g., with data specifying the source of the target-containing sample from which each 
30 sequence specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample is 
from a control tissue sample known to be free of pathological disorders. In a variation, at 
least one of the sources is a known pathological tissue specimen, e.g., a neoplastic 
35 . lesion or another tissue specimen to be analyzed for prostate cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for 
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each target species in a sample:; (1) a unique identification code, which can include, e.g., 
a target molecular structure ahd/or characteristic separation coordinate (e.g., 
electrophoretic coordinates); (2) sample source; and (3) absolute and/or relative quantity 
of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data in a 
computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and 
on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge 
states or transistor gate states, such as an array of cells in a DRAM device (e.g., each 
cell comprised of a transistor and a charge storage area, which are on the transistor). In 
one embodiment, the invention provides such storage devices, and computer systems 
built therewith, comprising a bit pattern encoding a protein expression fingerprint record 
comprising unique identifiers for at least 10 target data records cross-tabulated with 
target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a method 
for identifying related peptide or nucleic acid sequences, comprising performing a 
computerised comparison between a peptide or nucleic acid sequence assay record 
stored in or retrieved from a computer storage device or database and at least one other 
sequence. The comparison can include a sequence analysis or comparison algorithm or 
computer program embodiment thereof (e.g., BLAST, FASTA, TFASTA, GAP, BESTFIT 
- see above) and/or the comparison are of the relative amount of a peptide or nucleic 
acid sequence in a pool of sequences determined from a polypeptide or nucleic acid 
sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/.98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard 
(fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized 
sequence analysis, comparison, or relative quantitation method. 
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The invention also provides a network, comprising a plurality of computing devices linked 
via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN line, 
wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
5 magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also . provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a 
10 modem, ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the 
signal includes (in native or encrypted format) a bit pattern encoding data from an assay 
or a database comprising a plurality of assay results obtained by the method of the 
invention. 

15 In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of 

20 the assay results. Data for a query target is entered into the central processor via an I/O 
device. Execution of the computer program results in the central processor retrieving the 
assay data from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program are transferred to secondary 
25 memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a 
selected assay characteristic (e.g., binding to a selected affinity moiety) and the same 
characteristic of the query target and results are output via an I/O device. For example, a 
central processor are a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program are a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file are an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device are a terminal comprising a video display and a keyboard, a modem, 
35 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip 
reader, or other suitable I/O device. 
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The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the 
5 invention, which are stored in the computer; (3) a comparison target, such as a query 
target; and (4) a program for alignment and comparison, typically with rank-ordering of 
comparison results on the basis of computed similarity values. 

Transgenic Animals Expressing Ovarian Cancer-associated proteins and "Knock-Out" 
10 Animals 

The present invention also contemplates transgenic animals which are transgenic by 
virtue- -of comprising a polynucleotide of the invention, Le. animals transformed with a 
cancer-associated gene of the invention. Suitable animals are generally from the phylum 
chordata. Chordates includes vertebrate groups such as mammals, birds, reptiles and 
15 amphibians. Particular examples of mammals include non-human primates, cats, dogs, 
ungulates such as cows, goats, pigs, sheep and horses and rodents such as mice, rats, 
gerbils and hamsters. Transgenic animals within the meaning of the present invention 
are non-human animals and the production of transgenic humans is specifically 
excluded. 

20 

Techniques for producing transgenic animals are well known in the art A useful general 
textbook on this subject is Houdebine, Transgenic animals - Generation and Use 
(Harwood Academic, 1997) - an extensive review of the techniques used to generate 
transgenic animals from fish to mice and cows. 

25 

Advances in technologies for embryo micromanipulation now permit introduction of 
heterologous DNA into, for example, fertilized mammalian ova. For instance, totipotent or 
pluripotent stem cells are transformed by microinjection, calcium phosphate mediated 
precipitation, liposome fusion, retroviral infection or other means, the transformed cells 

30 are then introduced into, the embryo, and the embryo then develops into a transgenic 
animal. In a highly preferred method, developing embryos are infected with a retrovirus 
containing the desired DNA, and transgenic animals produced from the infected embryo. 
In a most preferred method, however, the appropriate DNAs are coinjected into the 
pronucleus or cytoplasm of embryos, preferably at the single cell stage, and the embryos 

35 allowed to develop into mature transgenic animals. Those techniques as well known. See 
reviews of standard laboratory procedures for microinjection of heterologous DNAs into 
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mammalian fertilized ova, including Hogan et a/., Manipulating the Mouse Embryo, (Cold 
Spring Harbor Press 1986); Krimpenfort et a/., Bio/Technology 9:844 (1991); Palmiter ef 
a/., Cell, 41: 343 (1985); Kraemer et a/., Genetic manipulation of the Mammalian Embryo, 
(Cold Spring Harbor Laboratory Press 1985); Hammer et a/., Nature, 315: 680 (1985); 
5 Wagner et a/., U.S. Pat No. 5,175,385; Krimpenfort et al. t U.S. Pat No. 5,175,384, the 
respective contents of which are incorporated herein by reference 

Another method used to produce a transgenic animal involves microinjecting a nucieic 
acid into pro-nuclear stage eggs by standard methods. Injected eggs are then cultured 
10 before transfer into the oviducts of pseudopregnant recipients. 

Transgenic animals may also be produced by nuclear transfer technology as described in 
Schnieke, A.E. et a/., 1997, Science, 278: 2130 and Cibelli, J.B. et a/., 1998, Science, 
280: 1256. Using this method, fibroblasts from donor animals are stably transfected with 
15 a plasmid incorporating the coding sequences for a binding domain or binding partner of 
interest under the control of regulatory. Stable transfectants are then fused to enucleated 
oocytes, cultured and transferred into female recipients. 

Analysis of animals which may contain transgenic sequences would typically be 
20 performed by either PCR or Southern blot analysis following standard methods. 

By way of a specific example for the construction of transgenic mammals, such as cows, 
nucleotide constructs comprising a sequence encoding a binding domain fused to GFP 
are microinjected using, for example, the technique described in U.S. Pat No. 4,873,191, 
25 into oocytes which are obtained from ovaries freshly removed from the mammal. The 
oocytes are aspirated from the follicles and allowed to settle before fertilization with 
thawed frozen sperm capacitated with heparin and prefractionated by Percoll gradient to 
isolate the motile fraction. 

30 The fertilized oocytes are centrifuged, for example, for eight minutes at 15,000 g to 
visualize the pronuclei for injection and then cultured from the zygote to morula or 
blastocyst stage in oviduct tissue-conditioned medium. This medium is prepared by using 
luminal tissues scraped from oviducts and diluted in culture medium. The zygotes must 
be placed in the culture medium within two hours following microinjection. 

35 
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Oestrous is then synchronized in the intended recipient mammals, such as cattle, by 
administering coprostanol. Oestrous is produced within two days and the embryos are 
transferred to the recipients 5-7 days after estrous. Successful transfer are evaluated in 
the offspring by Southern blot 

5 

Alternatively, the desired constructs are introduced into embryonic stem cells (ES cells) 
and the cells cultured to ensure modification by the transgene. The modified cells are 
then injected into the blastula embryonic stage and the blastulas replaced into 
pseudopregnant hosts. The resulting offspring are chimeric with respect to the ES and 
10 host cells, and nonchimeric strains which exclusively comprise the ES progeny are 
obtained using conventional cross-breeding. This technique is described, for example, in 
WO91/10741. 

In another embodiment, transgenic animals of the present invention are transgenic 
15 "knock-out" animals where a specific gene corresponding to a polynucleotide referred to 
in Tables 1-3 has been rendered non-functional by homologous recombination. The 
generation of K knock-ouf animals is similar to the production of other transgenic animals 
except that the polynucleotide constructs are designed to integrate into the endogenous 
genes and disrupt the function of the endogenous sequences. The generation of "knock- 
20 out" animals is known in the art, including the design of suitable constructs that will 
recombine at the appropriate site in the genome. 

In one embodiment, the heterologous sequence which it is desired to recombine into the 
genome of a target animal comprises a functional sequence but under the control of an 
25 inducible promoter so that expression of the gene are regulated by administration of an 
endogenous molecule. This are advantageous where disruption of the gene is 
embryonic-lethal. 

"Knock-ouf animals are used as animal models for the study of gene function. 

30 

Therapeutic peptides 

In accordance with this embodiment, ovarian cancer-associated proteins of the present 
invention are administered therapeutically to patients for a time and under conditions 
sufficient to ameliorate the growth of a tumor in the subject or to prevent tumor 
35 recurrence. 
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It is preferred to use peptides that do not consisting solely of naturally-occurring amino 
acids but which have been modified, for example to reduce immunogenicity, to increase 
circulatory half-life in the body of the patient, to enhance bioavailability and/or to enhance 
efficacy and/or specificity. 

A number of approaches have been used to modify peptides for therapeutic application. 
One approach is to link the peptides or proteins to a variety of polymers, such as 
polyethylene glycol (PEG) and polypropylene glycol (PPG) - see for example U.S. Patent 
Nos. 5,091,176, 5,214,131 and US 5,264,209. 

Replacement of naturally-occurring amino acids with a variety of uncoded or modified 
amino acids such as D-amino acids and N-methyl amino acids may also be used to 
modify peptides 

Another approach is to use bifunctiona! crosslinkers, such as N-succinimidyl 3-(2 
pyridyldithio) propionate, succinimkJyl 6-[3-(2 pyridyldithio) propionamido] hexanoate, and 
sulfosuccinimidyl 6-[3-(2 pyridyldithio) propionamido]hexanoate (see US Patent 
5,580,853). 

It are desirable to use derivatives of the ovarian cancer-associated proteins of the 
invention which are conformationally constrained. Conformational constraint refers to the 
stability and preferred conformation of the three-dimensional shape assumed by a 
peptide. Conformational constraints include local constraints, involving restricting the 
conformational mobility of a single residue , in a peptide; regional constraints, involving 
restricting the conformational mobility of a group of residues, which residues may form 
some secondary structural unit; and global constraints, involving the entire peptide 
structure. 

The active conformation of the peptide are stabilized by a covalent modification, such as 
cyclization or by incorporation of gamma-lactam or other types of bridges. For example, 
side chains are cycfized to the backbone so as create a L-gamma-lactam moiety on each 
side of the interaction site. See, generally, Hruby et al., "Applications of Synthetic 
Peptides," in Synthetic Peptides: A User's Guide: 259-345 (W. H. Freeman & Co. 1992). 
Cyclization also are achieved, for example, by formation of cystine bridges, coupling of 
amino and carboxy terminal groups of respective temiinal amino acids, or coupling of the 
amino group of a Lys residue or a related homolog with a carboxy group of Asp, Glu or a 
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related homolog. Coupling of the .alpha-amino group of a polypeptide with the epsilon- 
amino group of a lysine residue, using iodoacetic anhydride, are also undertaken. See 
Wood and Wetzel, 1992, Infl J. Peptide Protein Res. 39: 533-39. 

5 Another approach described in US 5,891,418 is to include a metal-ion complexing 
backbone in the peptide structure. Typically, the preferred metal-peptide backbone is 
based on the requisite number of particular coordinating groups required by the 
coordination sphere of a given complexing metal ion. In general, most of the metal ions 
that may prove useful have a coordination number of four to six. The nature of the 

10 coordinating groups in the peptide chain includes nitrogen atoms with amine, amide, 
imidazole, or guanidino functionalities; sulfur atoms of thiols or disulfides; and oxygen 
atoms of hydroxy, phenolic, carbonyl, or carboxyl functionalities. In addition, the peptide 
chain or individual amino acids are chemically altered to include a coordinating group, 
such as for example oxime, hydrazino, sulfhydryl, phosphate, cyano, pyridino, piperidino, 

15 or morpholino. The peptide construct are either linear or cyclic, however a linear 
construct is typically preferred. One example of a small linear peptide is Gly-Gly-G!y-Gly 
which has four nitrogens (an M* complexation system) in the back bone that can complex 
to a metal ion with a coordination number of four. 

20 A further technique for improving the properties of therapeutic peptides is to use 
non-peptide peptidomimetics. A wide variety of useful techniques are used to elucidating 
the precise structure of a peptide. These techniques include amino acid sequencing, x- 
ray crystallography, mass spectroscopy, nuclear magnetic resonance spectroscopy, 
computer-assisted molecular modeling, peptide mapping, and combinations thereof. 

25 Structural analysis of a peptide generally provides a large body of data which comprise 
the amino acid sequence of the peptide as well as the three-dimensional positioning of its 
atomic components. From this information, non-peptide peptidomimetics are designed 
that have the required chemical functionalities for therapeutic activity but are more stable, 
for example less susceptible to biological degradation. An example of this approach is 

30 provided in US 5,81 1 ,512. . 

Techniques for chemically synthesising therapeutic peptides of the invention are 
described in the above references and also reviewed by Borgia and Fields, 2000, 
TibTech 18: 243-251 and described in detail in the references contained therein. 

35 

Assays for therapeutic compounds 
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The ovarian cancer proteins, nucleic acids, and antibodies as described herein are used 
in drug screening assays to identify candidate compounds for use in treating ovarian 
cancer. The ovarian cancer-associated proteins, antibodies, nucleic acids, modified 
proteins and cells containing ovarian cancer sequences are used in drug screening 
assays or by evaluating the effect of drug candidates on a "gene expression profile" or 
expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, et al, 1998, Science 279: 84-88); Heid, 1996, Genome Res 6: 986-94). 

In a preferred embodiment, the ovarian cancer-associated proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified ovarian cancer- 
associated proteins are used in screening assays. That is, the present invention provides 
methods for screening for compounds/agents which modulate the ovarian cancer 
phenotype or an identified physiological function of a ovarian cancer-associated protein. 
As above, this are done on an individual gene level or by evaluating the effect of drug 
candidates on a "gene expression profile". In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent, see 
Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays are 
executed. In a preferred embodiment, assays are run on an individual gene or protein 
level. That is, having identified a particular gene as up regulated in ovarian cancer, test 
compounds are screened for the ability to modulate gene expression or for binding to the 
ovarian cancer-associated protein. "Modulation" thus includes both an increase and a 
decrease in gene expression. The preferred amount of modulation will depend on the 
original change of the gene expression in normal versus tissue undergoing ovarian 
cancer, with changes of at least 10%, preferably 50%, more preferably 100-300%, and in 
some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold increase in 
ovarian cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, a 10-fold decrease in ovarian cancer tissue compared to normal tissue 
often provides a target value of a 10-fold increase in expression to be induced by the test 
compound. 
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The amount of gene expression are monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself are 
monitored, e.g., through the use of antibodies to the ovarian cancer-associated protein 
and standard immunoassays. Proteomics and separation techniques may also allow 
quantification of expression. . 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, i.e., an expression profile; is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein. 

In this embodiment, the ovarian cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of ovarian cancer sequences in a 
particular cell. Alternatively, PCR are used. Thus, a series are used with dispensed 
primers in desired wells. A PCR reaction can then be performed and analyzed for each 
well. 

Expression monitoring are performed to identify compounds that modify the expression 
of one or more ovarian cancer-associated sequences, e.g., a polynucleotide sequence 
set out in Tables 1-3. In a preferred embodiment, a test modulator is added to the cells 
prior to analysis. Moreover, screens are also provided to identify agents that modulate 
ovarian cancer, modulate ovarian cancer-associated proteins, bind to a ovarian cancer- 
associated protein, or interfere with the binding of a ovarian cancer-associated protein 
and an antibody or other binding partner. 

The term "test compound- or "drug candidate" or "modulator or grammatical equivalents 
as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the ovarian cancer phenotype or the expression of a ovarian cancer 
sequence, e.g., a nucleic acid or protein sequence. In preferred embodiments, 
modulators alter expression profiles, or expression profile nucleic acids or proteins 
provided herein. In one embodiment, the modulator suppresses a ovarian cancer 
phenotype, e.g. to a normal tissue fingerprint In another embodiment, a modulator 
induced a ovarian cancer phenotype. Generally, a plurality of assay mixtures are run in 
parallel with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control, i.e., 
at zero concentration or below the level of detection. 
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. Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less 
5 than 2000, or less than 1500 or less than 1000 or less than 500 Daltons. Candidate 
agents comprise functional groups necessary for structural interaction with proteins, 
particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl 
or carboxyl group, preferably at least two of the functional chemical groups. The 
candidate agents often comprise cyclical carbon or heterocyclic structures and/or 
10 aromatic or polyaromatic structures substituted with one or more of the above functional 
groups. Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or 
combinations thereof. Particularly preferred are peptides. 

15 In one aspect a modulator will neutralize the effect of a ovarian cancer-associated 
. protein. By "neutralize" is meant that activity of a protein is inhibited or blocked and the 
consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be screened 
20 for an ability to bind to a ovarian cancer polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying 
a chemical compound (called a "lead compound") with some desirable property or 
activity, e.g., inhibiting activity, creating variants of the lead compound, and evaluating 
the property and activity of those variant compounds. Often, high throughput screening 
25 (HTS) methods are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
30 assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual 
therapeutics. 

35 A combinatorial chemical library is a collection of diverse chemical compounds generated 
by either chemical synthesis or biological synthesis by combining a number of chemical 
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"building blocks" such as reagents. For example, a linear combinatorial chemical library, 
such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length 
(i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds are synthesized through such combinatorial mixing of chemical building 
blocks (Gallop eta!., 1994, J. Med. Chem. 37(9): 1233-1 251). 

Preparation and screening of combinatorial chemical libraries is well known to those of 
skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries, peptoids, encoded peptides, random bio-oligomers, nonpeptidal 
peptidomimetics, analogous organic syntheses of small compound libraries, nucleic acid 
libraries, peptide nucleic acid libraries, antibody libraries, carbohydrate libraries and small 
organic molecule libraries. 

The assays to identify modulators are amenable to high throughput screening. Preferred 
assays thus detect enhancement or inhibition of ovarian cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties of 
particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., 
U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, efc.;. These 
systems typically automate entire procedures, including all samlsle and reagent pipetting, 
liquid dispensing, timed incubations, and final readings of the microplate in detectors) 
appropriate for the assay. These configurable systems provide high throughput and rapid 
start up as well as a high degree of flexibility and customization. The manufacturers of 
such systems provide detailed protocols for various high throughput systems. Thus, e.g., 
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Zymark Corp. provides technical bulletins describing screening systems for detecting the 
modulation of gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, 
or random or directed digests of proteinaceous cellular extracts, are used. In this way 
libraries of proteins are made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian 
proteins, with the latter being preferred, and human proteins being especially preferred. 
Particularly useful test compound will be directed to the class of proteins to which the 
target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 amino 
acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides are digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and 
peptide consists of essentially random nucleotides and amino acids, respectively. Since 
generally these random peptides (or nucleic acids, discussed below) are chemically 
synthesized, they may incorporate any nucleotide or amino acid at any position, the 
synthetic process are designed to generate randomized proteins or nucleic acids, to 
allow the formation of all or most of the possible combinations over the length of the 
sequence, thus forming a library of randomized candidate bioactive proteinaceous 
agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited 
number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino 
acids, hydrophilic residues, sterically biased (either small or large) residues, towards the 
creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
prolines for SH-3 domains, serines, threonines, tyrosines or hlstidines for 
phosphorylation sites, etc., or to purines, etc. 
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Modulators of ovarian cancer can also be nucleic acids, as defined below. As described 
above generally for proteins; nucleic acid modulating agents are naturally occurring 
nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, 
digests of procaryotic or eucaryotic genomes are used as is outlined above for proteins. 

In certain embodiments, the activity of a ovarian cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a 
nucleic acid complementary to, and which can preferably hybridize specifically to, a 
coding mRNA nucleic acid sequence, e.g., a ovarian cancer-associated protein mRNA, 
or a subsequence thereof. Binding of the antisense polynucleotide to the mRNA reduces 
the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and 
other sulfur containing species which are known for use in the art. Analogs are 
comprehended by this invention so long as they function effectively to hybridize with the 
ovarian cancer-associated protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; 
Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, or 
are synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art 

Antisense molecules as used herein include antisense or sense oligonucleotides. Sense 
oligonucleotides can, e.g., be employed to block transcription by binding to the 
anti-sense strand. The antisense and sense oligonucleotide comprise a single-stranded 
nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) 
or DNA (antisense) sequences for ovarian cancer molecules. Antisense or sense 
oligonucleotides, according to the present invention, comprise a fragment generally at 
least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to 
derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding 
a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and van 
der Krol et al. (BioTechniques 6:958 (1988)). 
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In addition to antisense polynucleotides, ribozymes are used to target and inhibit 
transcription of ovarian cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 
ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al., Adv. in 
Pharmacology 25: 289-317 (1994) for a general review of the properties of different 5 
ribozymes). 

Methods of preparing ribozymes are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang et al., Proc. Natl. Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al., Human Gene Therapy 1:39-45 (1994); Leavitt et al., Proc. Natl. Acad. Sci. USA 
92:699- 703 (1995); Leavitt et al., Human Gene Therapy 5:1151-120 (1994); and 
Yamada et al., Virology 205: 121-126(1994)). 

Polynucleotide modulators of ovarian cancer are introduced into a cell containing the 
target nucleotide sequence by formation of a conjugate with a ligand binding molecule, 
as described in WO 91/04753. Suitable ligand binding molecules include, but are not 
limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule 
does not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense 
oligonucleotide or its conjugated version into the cell. Alternatively, a polynucleotide 
modulator of ovarian cancer are introduced into a cell containing the target nucleic acid 
sequence, e.g., by formation of an polynucleotide-lipid complex, as described in WO 
90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to 
methods of treatment 

As noted above, gene expression monitoring is conveniently used to test candidate 
modulators (e.g., protein, nucleic acid or small molecule). After the candidate agent has 
been added and the cells allowed to incubate for some period of time, the sample 
containing a target sequence to be analyzed is added to the biochip. If required, the 
target sequence is prepared using known techniques. For example, the sample are 
treated to lyse the cells, using known lysis buffers, electroporation, etc., with purification 
and/or amplification such as PCR performed as appropriate. For example, an in vitro 
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transcription with labels covalently attached to the nucleotides is performed. Generally, 
the nucleic acids are labeled with piotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the. target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent a chemical, or a radioactive signal, to provide a means of detecting 
the target sequence's specific binding to a probe. The label also are an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that are detected. Alternatively, the label are a 
labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also are a moiety or compound, such as, 
an epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, 
the streptavidin is labeled as described above, thereby, providing a detectable signal for 
the bound target sequence. Unbound labeled streptavidin is typically removed prior to 
analysis. 

As will be appreciated by those in the art, these assays are direct hybridization assays or 
can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,118. 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the bidchip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions are used in the present invention, including high, 
moderate and low stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization 
complex only in the presence of target Stringency are controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to. temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, oiganic 
solvent concentration, etc. 



These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697. Thus it are desirable to perform certain steps at 
35 higher stringency conditions to reduce non-specific binding. 
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The reactions outlined herein are accomplished in a variety of ways. Components of the 
reaction are added simultaneously, or sequentially, in different orders, with preferred 
embodiments outlined below. In addition, the reaction may include a variety of other 
reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
5 which are used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, antknicrobial agents, etc., may 
also be used as appropriate, depending on the sample preparation methods and purity of 
the target. 

10 

The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression 
profile, 

15 Screens are performed to identify modulators of the ovarian cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In 
another embodiment, e.g., for diagnostic applications, having identified differentially 
expressed genes important in a particular state, screens are performed to identify 

20 modulators that alter expression of individual genes. In an another embodiment, 
screening is performed to identify modulators that alter a biological function of the 
expression product of a differentially expressed gene. Again, having identified the 
importance of a gene in a particular state, screens are performed to identify agents that 
bind and/or modulate the biological activity of the gene product 

25 

In addition screens are done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a ovarian cancer 
expression pattern leading to a normal expression pattern, or to modulate a single 
ovarian cancer gene expression profile so as to mimic the expression of the gene from 

30 normal tissue, a screen as described above are performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated ovarian cancer tissue reveals genes that are not 
expressed in normal tissue or ovarian cancer tissue, but are expressed in agent treated 
tissue. These agent-specific sequences are identified and used by methods described 

35 herein for ovarian cancer genes or proteins. In particular these sequences and the 
proteins they encode find use in marking or identifying agent treated cells. In addition, 
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antibodies are raised against the agent induced proteins and used to target novel 
therapeutics to the treated ovarian cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of ovarian 
5 cancer cells, that have an associated ovarian cancer expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the 
cells in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) are put into a viral construct 
10 such as an adenoviral or retroviral construct, and added to the cell, such that expression 
of the peptide agent is accomplished. Regulatable gene administration systems can also 
be used. 

Once the test compound has been administered to the cells, the cells are washed if 
15 desired and are allowed to incubate under preferably physiological conditions for some 
period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., ovarian cancer tissue are screened for agents that modulate, e.g., induce or 
20 suppress the ovarian cancer phenotype. A change in at least one gene, preferably many, 
of the expression profile indicates that the agent has an effect on ovarian cancer activity. 
By defining such a signature for the ovarian cancer phenotype, screens for new drugs 
that alter the phenotype are devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor 
25 does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens are done on individual genes and 
gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of either the expression 

30 of the gene or the gene product itself are done. The gene products of differentially 
expressed genes are sometimes referred to herein as "ovarian cancer-associated 
proteins" or a "ovarian cancer modulatory protein". The ovarian cancer modulatory 
protein are a fragment, or alternatively, be the full length protein to the fragment encoded 
by the nucleic acids referred to in Tables 1-3. Preferably, the ovarian cancer modulatory 

35 protein is a fragment In a preferred embodiment, the ovarian cancer amino acid 
sequence which is used to determine sequence identity or similarity is encoded by a 
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nucleic acid referred to in Tables 1-3. In another embodiment, the sequences are 
naturally occurring allelic variants of a protein encoded by a nucleic acid referred to in 
Tables 1-3. In another embodiment, the sequences are sequence variants as further 
described herein. 

5 

Preferably, the ovarian cancer modulatory protein is a fragment of approximately 14 to 24 
amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the 
fragment has an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of 
10 the fragment is kept as a free acid and the N-terminus is a free amine to aid in coupling, 
i.e., to cysteine. 

In one embodiment the ovarian cancer-associated proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the ovarian cancer- 
15 associated protein is conjugated to BSA. 

Measurements of ovarian cancer polypeptide activity, or of ovarian cancer or the ovarian 
cancer phenotype are performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the ovarian cancer polypeptides are measured 

20 by examining parameters described above. A suitable physiological change that affects 
activity are used to assess the influence of a test compound on the polypeptides of this 
invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of ovarian cancer 
associated with tumours, tumour growth, tumour metastasis, neovascularization, 

25 hormone release, transcriptional changes to both known and uncharacterized genetic 
markers (e.g., northern blots), changes in cell metabolism such as cell growth or pH 
changes, and changes in intracellular second messengers such as cGMP. In tire assays 
of the invention, mammalian ovarian cancer polypeptide is typically used, e.g., mouse, 
preferably human. 

30 

Assays to identify compounds with modulating activity are performed in vitro. For 
example, a ovarian cancer polypeptide is first contacted with a potential modulator and 
incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the ovarian cancer polypeptide levels are determined in vitro by measuring the level of 
35 protein or mRNA. The level of protein is measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the ovarian cancer 
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polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 
PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic 
5 acids, radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system are devised using the ovarian cancer-associated 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or (beta-gal. The reporter construct is typically transfected into a cell. After 
10 treatment with a potential modulator, the amount of reporter gene transcription, 
translation, or activity is measured according to standard techniques known to those of 
skill in the art. 

In a preferred embodiment, as outlined above, screens are done on individual genes and 
15 gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of the expression of the 
gene or the gene product itself are done. The gene products of differentially expressed 
genes are sometimes referred to herein as "ovarian cancer-associated proteins." The 
ovarian cancer-associated protein are a fragment, or alternatively, be the full length 
20 protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
25 expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are 
identified, variants are further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated gene 
30 product is used; that is, the gene products of one or more differentially expressed nucleic 
acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present 
Alternatively, cells comprising the ovarian cancer-associated proteins are used in the 
assays. 
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Thus, in a preferred embodiment, the methods comprise combining a ovarian cancer- 
associated protein and a candidate compound, and determining the binding of the. 
compound to the ovarian cancer-associated protein. Preferred embodiments utilize the 
human ovarian cancer-associated protein, although other mammalian proteins may also 
5 be used, e.g. for the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or derivative ovarian cancer-associated 
proteins are used. 

Generally, in a preferred embodiment of the methods herein, the ovarian cancer- 

10 associated protein or the candidate agent is non-diffusably bound to an insoluble support 
having isolated sample receiving areas (e.g. a microtiter plate, an array, etc.). The 
insoluble supports are made of any composition to which the compositions are bound, is 
readily separated from soluble material, and is otherwise compatible with the overall 
method of screening. The surface of such supports are solid or porous and of any 

15 convenient shape. Examples of suitable insoluble supports include microtiter plates, 
arrays, membranes and beads. These are typically made of glass, plastic (e.g., 
polystyrene), polysaccharides, nylon or nitrocellulose, teflon™, etc. microtitre plates and 
arrays are especially convenient because a large number of assays are carried out 
simultaneously, using small amounts of reagents and samples. The particular manner of 

20 binding of the composition is not crucial so long as it is compatible with the reagents and 
overall methods of the invention, maintains the activity of the composition and is 
nondiffusable. Preferred methods of binding include the use of antibodies (which do not 
sterically block either the ligand binding site or activation sequence when the protein is 
bound to the support), direct binding to "sticky" or ionic supports, chemical crosslinking, 

25 the synthesis of the protein or agent on the surface, etc. Following binding of the protein 
or agent, excess unbound material is removed by washing. The sample receiving areas 
may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

30 In a preferred embodiment, the ovarian cancer-associated protein is bound to the 
support, and a test compound is added to the assay. Alternatively, the candidate agent is 
bound to the support and the ovarian cancer-associated protein is added. Novel binding 
agents include specific antibodies, non-natural binding agents identified in screens of 
chemical libraries, peptide analogs, etc. Of particular interest are screening assays for 

35 agents that have a low toxicity for human cells. A wide variety of assays are used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
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shift assays, immunoassays for protein binding, functional assays (phosphorylation 
assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the ovarian cancer- 
associated protein are done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion 
of the ovarian cancer-associated protein to a solid support, adding a labeled candidate 
agent (e.g., a fluorescent label), washing off excess reagent, and determining whether 
the label is present on the solid support. Various blocking and washing steps are utilized . 
as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) are labeled. Alternatively, more than one 
component are labeled with different labels, e.g., 125 l for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety known to bind to the target molecule 
(i.e., a ovarian cancer-associated protein), such as an antibody, peptide, binding partner, 
ligand, etc. Under certain circumstances, there are competitive binding between the 
compound and the binding moiety, with the binding moiety displacing the compound. In 
one embodiment the test compound is labeled. Either the compound, or the competitor, 
or both, is added first to the protein for a time sufficient to allow binding, if present. 
Incubations are performed at a temperature which facilitates optimal activity, typically 
between 4 and 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid 
high throughput screening. Typically between 0.1 and 1 hour will be sufficient. Excess 
reagent is generally removed or washed away. The second component is then added, 
and the presence or absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first followed by the test compound. 
Displacement of the competitor is an indication that the test compound is binding to the 
ovarian cancer-associated protein and thus is capable of binding to, and potentially 
modulating, the activity of the ovarian cancer-associated protein. In this embodiment 
either component are labeled. Thus, e.g., if the competitor is labeled, the presence of 
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label in the wash solution indicates displacement by the agent. Alternatively, if the test 
compound is labeled, the presence of the label on the support indicates displacement. 

In an alternative preferred embodiment, the test compound is added first, with incubation 
5 and washing, followed by the competitor. The absence of binding by the competitor may 
indicate that the test compound is bound to the ovarian cancer-associated protein with a 
higher affinity. Thus, if the test compound is. labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound 
is capable of binding to the ovarian cancer-associated protein. 

10 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the ovarian cancer-associated 
proteins. In this embodiment, the methods comprise combining a ovarian cancer- 
associated protein and a competitor in a first sample. A second sample comprises a test 

15 compound, a ovarian cancer-associated protein, and a competitor. The binding of the 
competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the 
ovarian cancer-associated protein and potentially modulating its activity. That is, if the f 
binding of the competitor is different in the second sample relative to the first sample, the 

20 agent is capable of binding to the ovarian cancer-associated protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native ovarian cancer-associated protein, but cannot bind to modified ovarian cancer- 
associated proteins. The structure of the ovarian cancer-associated protein are modeled, 
25 and used in rational drug design to synthesize agents that interact with that site. Drug 

candidates that affect the activity of a ovarian cancer-associated protein are also j 

identified by screening drugs for the ability to either enhance or reduce the activity of the 

protein. I 

30 Positive controls and negative controls are used in the assays. Preferably control and 
test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and 
the amount of bound, generally labeled agent determined. For example, where a 

35 radiolabel is employed, the samples are counted in a scintillation counter to determine 
the amount of bound compound. 
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A variety of other reagents are included in the screening assays. These include reagents 
like salts, neutral proteins, e.g. albumin, detergents, etc. which are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or background interactions. 
5 Also reagents that otherwise improve the efficiency of the assay, such as protease 
inhibitors, nuclease inhibitors, anti-microbial agents, etc., are used. The mixture of 
components are added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a compound 
10 capable of modulating the activity of a ovarian cancer-associated protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising ovarian cancer- 
associated proteins. Preferred cell types include almost any cell. The cells contain a 
recombinant nucleic acid that encodes a ovarian cancer-associated protein. In a 
preferred embodiment, a library of candidate agents are tested on a plurality of cells. 

15 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In 
20 another example, the determinations are determined at different stages of the cell cycle 
process. 

In this way, compounds that modulate ovarian cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the 
25 ovarian cancer-associated protein. Once identified, similar structures are evaluated to 
* identify critical structural feature of the compound. 

In one embodiment, a method of inhibiting ovarian cancer cell division is provided. The 
method comprises administration of a ovarian cancer inhibitor. In another embodiment a 
30 method of inhibiting ovarian cancer is provided. The method comprises administration of 
a ovarian cancer inhibitor. In a further embodiment, methods of treating cells or 
individuals with ovarian cancer are provided. The method comprises administration of a 
ovarian cancer inhibitor, 

35 In one embodiment, a ovarian cancer inhibitor is an antibody as discussed above. In 
another embodiment, the ovarian cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, and metastasis assays are known to those of skill in 
the art, as described below. 

5 Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For 
example, transformed cells can grow in stirred suspension culture or suspended in 
semi-solid media, such as semi-solid or soft agar. The transformed cells, when 

10 transfected with tumour suppressor genes, regenerate normal phenotype and require a 
solid substrate to attach and grow. Soft agar growth or colony formation in suspension 
assays are used to identify modulators of ovarian cancer sequences, which when 
expressed in host cells, inhibit abnormal cellular proliferation and transformation. A 
therapeutic compound would reduce or eliminate the host cells' ability to grow in stirred 

15 suspension culture or suspended in semisolid media, such as semi-solid or soft. 

Techniques for soft agar growth or cojony formation in suspension assays are described 
in Freshney, Culture of Animal Cells a Manual of Basic Technique (3rd ed„ 1994), herein 
incorporated by reference. See also, the methods section of Garkavtsev et al. (I996), 
20 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they touch 
other cells. When the cells touch one another, they are contact inhibited and stop 

25 growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow 
to a higher saturation density than normal cells. This are detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 

30 saturation density are used to measure density limitation of growth. See Freshney 
(1994), supra. The transformed cells, when transfected with tumour suppressor genes, 
regenerate a normal phenotype and become contact inhibited and would grow to a lower 
density. 

35 In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
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with a ovarian cancer-associated sequence and are grown for 24 hours at saturation 
density in non-limiting medium conditions. The percentage of cells labeling with 
( 3 H)-thymidine is determined autoradiographically. See, Freshney (1994), supra. 

5 Growth factor or serum dependence 

Transformed ceils have a lower serum dependence than their normal counterparts (see, 
e.g., Temin, J. Natl Cancer Insti. 37:167-175 (1966); Eagle et a/., J. Exp. Med. 
131:836-879 (1970)); Freshney, supra, this is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 

10 cells are compared with that of control. Tumor specific markers levels Tumor cells 
release an increased amount of certain factors (hereinafter "tumour specific markers") 
than their normal counterparts. For example, plasminogen activator (PA) is released from 
human glioma at a higher level than from normal brain cells (see, e.g., Gullino, 
Angiogenesis, tumour vascularization, and potential interference with tumour growth, in 

15 Biological Responses in Cancer, pp. 1 78- 1 84 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumour cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer BioL (1992)). 
Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 

20 Strickland & Beers, J. Biol. Chem. 251:5694-5702 (1976); Whur et al., Br. J. Cancer 
42:305 312 (1980); Gullino, Angiogenesis, tumour vascularization, and potential 
interference with tumour growth, in Biological Responses in Cancer, pp. 178-184 (Mihich 
(ed.) 1985); Freshney Anticancer Res. 5:111-130 (1985). 

25 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel-or some other extracellular matrix constituent 
are used as an assay to identify compounds that modulate ovarian cancer-associated 
sequences. Tumor cells exhibit a good correlation between malignancy and invasiveness 
of cells into Matrigel or some other extracellular matrix constituent In this assay, 

30 tumourigenic cells are typically used as host cells. Expression of a tumour suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, are used. Briefly, the level of invasion 
of host cells are measured by using filters coated with Matrigel or some other 
35 extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and 
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distance moved, or by prelabeling the cells with 125 1 and counting the radioactivity on 
the distal side of the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

5 Effects of ovarian cancer-associated sequences on cell growth are tested in transgenic 
or immune-suppressed mice. Knock-out transgenic mice are made, in which the ovarian 
cancer gene is disrupted or in which a ovarian cancer gene is inserted. Knock- out 
transgenic mice are made by insertion of a marker gene or other heterologous gene into 
the endogenous ovarian cancer gene site in the mouse genome via homologous 
10 recombination. Such mice can also be made by substituting the endogenous ovarian 
cancer gene with a mutated version of the ovarian cancer gene, or by mutating the 
endogenous ovarian cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells containing 
15 the newly engineered genetic lesion are injected into a host mouse embryo, which is 
re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding 
the chimeric mice it is possible to obtain a new line of mice containing the introduced 
genetic lesion (see, e.g., Capecchi et al., Science 244:1288 (1989)). Chimeric targeted 
20 mice are derived according to Hogan et aL, Manipulating the Mouse Embryo: A 
Laboratory Manual, Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. ( IRL Press, Washington, 
D.C., (1987). 



25 Alternatively, various immune-suppressed or immune-deficient host animals are used. 
For example, genetically athymic "nude" mouse (see, e.g., Giovanella et al., J. Natl. 
Cancer Inst. 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley et al., Br. J. Cancer 38:263 (1978); Selby et al., Br. J. Cancer 
41:52 (1980)) are used as a host. Transplantable tumour cells (typically about 10 8 cells) 

30 injected into isogenic hosts will produce invasive tumours in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumours, 
cells expressing a ovarian cancer-associated sequences are injected subcutaneously. 
After a suitable length of time, preferably 4 to 8 weeks, tumour growth is measured (e.g. 
by volume or by its two largest dimensions) and compared to the control. Tumours that 

35 have a statistically significant reduction (using, e.g. Student's T test) are said to have 
inhibited growth. 
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Administration 

therapeutic reagents of the invention are administered to patients, therapeutically. 
Typically, such proteins/polynucleotides and substances may preferably be combined 
5 with various components to produce compositions of the invention. Preferably the 
compositions are combined with a pharmaceutical^ acceptable carrier or diluent to 
produce a pharmaceutical composition (which are for human or animal use). Suitable 
carriers and diluents include isotonic saline solutions, for example phosphate-buffered 
saline. The composition of the invention are administered by direct injection. The 
10 composition are formulated for parenteral, intramuscular, intravenous, subcutaneous, 
intraocular, oral, vaginal or transdermal administration. Typically, each protein are 
administered at a dose of from 0.01 to 30 mg/kg body weight, preferably from 0.1 to 10 
mg/kg, more preferably from 0.1 to 1 mg/kg body weight. 

15 Polynucleotides/vectors encoding polypeptide components for use in modulating the 
activity of the ovarian cancer-associated proteins/polynucleotides are administered 
directly as a naked nucleic acid construct When the polynucleotides/vectors are 
administered as a naked nucleic acid, the amount of nucleic acid administered may 
typically be in the range of from 1 pg to 10 mg, preferably from 100 to 1 mg. 

20 

Uptake of naked nucleic acid constructs by mammalian cells is enhanced by several 
known transfection techniques for example those including the use of transfection 
agents. Example of these agents include cationic agents (for example calcium phosphate 
and DEAE-dextran) and lipofectants (for example lipofectam™ and transfectam™). 
25 Typically, nucleic acid constructs are mixed with the transfection agent to produce a 
composition. 

Preferably the polynucleotide or vector of the invention is combined with a 
pharmaceutical^ acceptable carrier or diluent to produce a pharmaceutical composition. 
30 Suitable carriers and diluents include isotonic saline solutions, for example phosphate- 
buffered saline. The composition are formulated for parenteral, intramuscular, 
intravenous, subcutaneous, oral, intraocular or transdermal administration. 

The pharmaceutical compositions are administered in a range of unit dosage forms 
35 depending on the method of administration. For example, unit dosage forms suitable for 
oral administration include, powder, tablets, pills, capsules and lozenges. Orally 
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administered dosage forms will typically be formulated to protect the active ingredient 
from digestion and may therefore be complexed with appropriate carrier molecules 
and/or packaged in an appropriately resistant carrier. Suitable carrier molecules and 
packaging materials/barrier materials are known in the art. 

5 

The compositions of the invention are administered for therapeutic or prophylatic 
treatments. In therapeutic applications, compositions are administered to a patient 
suffering from a disease (e.g. ovarian cancer) in an amount sufficient to cure or at least 
partially ameliorate the disease and its complications. An amount adequate to 
10 accomplish this is defined as a "therapeutically effective dose". An amount of the 
composition that is capable of preventing or slowing the development of cancer in a 
patient is referred to as a "prophylactically effective dose". 

The routes of administration and dosages described are intended only as a guide since a 
15 skilled practitioner will be able to determine readily the optimum route of administration 
and dosage for any particular patient and condition. 

The present invention is further described with reference to the accompanying drawings 
and the following non-limiting examples. 

20 
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EXAMPLE 1 

Gene expression profiling to identify differentially-expressed 
genes in ovarian cancer 

1. Tissue Bank and Database 

5 Tissue was collected from patients undergoing treatment at the GCC, we have established 
an Ovarian Cancer Tissue Bank and Clinical Database that currently holds data on over 
400 cases treated at the GCC between 1986 and 2002. Tissue (currently 149 fresh/frozen 
and 292 archival fixed paraffin-embedded samples) was acquired from patients undergoing 
cytoreductive surgery and does not interfere with the collection of tissue for the normal 

10 processing of diagnostic specimens. Patient consent, included in all our studies, was 
collected prior to surgery. Tissue specimens and their associated pathology reports were 
coded in order to maintain patient confidentiality. Uncoded data was electronically and/or 
physically locked with restricted access by appropriate senior investigators only. Clinical 
^ (diagnosis, treatment, residual disease) and pathological data (tumour grade, stage) were 

15 collected and updated (disease recurrence, patient survival) at regular intervals. This study 
has ethical approval from the South Eastern Sydney Area Health Service Research Ethics 
Committee, Australia. Clinical data and tissue collection are ongoing. 

2. Genetic profiling of ovarian cancers 

20 In order to identify those genes differentially regulated in epithelial ovarian cancer 51 
ovarian cancer tumor samples were manually dissected from biological samples derived 
from subjects undergoing cytoreductive surgery. These samples comprised 8 
endometrioid tumors, 4 mucinous tumors and 31 serous epithelial ovarian tumors, 12 
corresponding omental deposits and 8 borderline (low-malignant potential) tumors. 

25 

RNA was isolated from the tumor samples in addition to 4 normal ovary samples using 
Trizol reagent (Life Technologies, Rockville, MD, USA) essentially according to 
manufacturer's instructions. RNA was then reverse transcribed using an oligo(dT) 
anchored oligonucleotide that additionally comprised a 77 promoter sequence. Isolated 
30 cDNA was then transcribed in vitro using the 77 MEGAscript kit (Ambion, Austin, TX, 
USA) according to manufacturer's instructions. Transcription was performed with 
biotinylated nucleotides (Bio-11-CTP and Bio-16-UTP) to enable detection of the 
transcribed cRNA. 

35 Levels of gene expression in the cancer samples was then determined by analysing the 
transcribed cDNA samples using customized Affymetrix GeneChip® microarrays that 
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comprise 59,618 oligonucleotide probe sets. These probe sets facilitate analysis of 
46,000 gene clusters, representing over 90% of the predicted expressed human genome. 

Data were normalized, and changes in gene expression detected using a ranked 
5 penalized t-statistic with p-values adjusted for multiple testing using the Holm procedure. 
Analysis was performed using the LIMMA package (available from Bioconductor, 
Biostatistics Unit of the Dana Farber Cancer Institute at the Harvard Medical 
School/Harvard School of Public Health). 

10 Gene expression in 186 samples representing 52 different tissues of the body was also 
determined using the previously described methods to facilitate the identification of 
changes in gene expression that are specific for ovarian cancer. 

Using this method 284 up-regulated transcripts and 186 down-regulated transcripts were 
15 identified. 

In order to determine the efficacy of such a method of analysis for determining gene 
expression changes associated with ovarian cancer, those genes identified were 
compared to results of published expression profile studies. Using this method, 71 
20 genes were identified in the present study that had been previously identified, including, 
for example, genes known to be over-expressed in ovarian cancer, such as, for example 
MUC1 and E-cadherin. 

The ovarian cancer-associated genes and proteins set forth in Table 1 include 
25 sequences that are up-regulated or down-regulated in ovarian cancer subjects, including 
subjects suffering specifically from serous, encodmetrioid, mucinous or clear cell ovarian 
cancer, or non-invasive (borderline) ovarian cancers of any phenotype, and subjects that 
suffered from recurrences of ovarian cancer in the medium term, or died within the 
medium term. 

30 

Data presented in Table 2 indicate those genes that are expressed at significantly higher 
levels or significantly reduced levels in patients suffering from serous cancer relative tot 
he level of expression of the same genes in a normal or healthy subject 
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EXAMPLE2 

Validation of gene expression profiling results using tissue microarrays 
Each of the transcripts identified as being differentially-expressed specifically in ovarian 
cancer was then further analysed using in situ hybridization or immunohistochemical 
staining of tissue microarrays constructed from a large cohort of primary ovarian tumor 
tissue. Such analysis confirms upregulation, down-regulation or total loss of expression 
of the transcripts identified in the microarray analysis of tumor samples. 

Furthermore, as each of the samples in the tissue microarray have been 
clinicopathologically characterized (for example to identify cancer grade and/or disease 
stage) and the subjects from whom the tumors were isolated continuously monitored (to 
detect for example, death or relapse of cancer), changes with gene expression were also 
analysed for correlation with such parameters in order to determine predictive changes in 
gene expression. 

The relative intensity and percentage of cells staining was determined and evaluated for 
associations with clinical stage and grade of disease and disease relapse using the 
Kaplan Meier method and log-rank test, and by univariate and bivariate analyses in a 
Cox proportional hazards model for gene expression and other clinical and pathologic 
predictors of outcome to determine the potential independent prognostic value of the 
markers being assessed. 

Immunohistochemical analysis has been performed on several genes identified in gene 
profiling analysis of ovarian cancer samples. For example, SOX17, Ep-CAM and daudin 
3 were shown by gene profiling analysis to be specifically up-regulated in ovarian cancer 
compared to normal ovaries (Figure 1 and Figure 2). Using immunohistochemical 
analysis, it was determined that SOX17, Ep-CAM and claudin 3 are upregulated in 
serous cancer, mucinous cancer, endometroid cancer and clear cell ovarian cancer. 

Furthermore, immunohistochemical analysis has been used to analyse the expression of 
several other genes that are specifically upregulated in mucinous ovarian cancer. In 
particular the expression of Ll-cadherin (cadherin 17), meprin alpha and Galectin 4 as 
detected using immunohistochemistry is shown in Figure 3. There was a significant 
increase in protein detected in the mucinous ovarian cancer samples compared to the 
normal ovary sample and serous ovarian cancer sample. 
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Immunohistochemical analysis was also performed to analyse the expression of three 
genes that are known to bis upregulated in ovarian cancer (CA125, MUC-1 and E- 
cadherin) (Rgures 1 and 2). 

5 EXAMPLE 3 

Identification of prognostic markers of ovarian cancer 
Using a classical survival analysis to mine expression profiling data several genes that 
are associated with poor patient outcome (ie death or cancer relapse) have been 
identified (Tables 2 and 3). Such genes have clinical utility as prognostic indicators of 
10 disease. . 

Using detailed clinicopathological and postoperative data on all of the 51 patients 
included in our transcriptional profiling studies, including details of biochemical (eg. rising 
serum CA-125) and/or clinical recurrence of disease and overall survival, expression 
15 profiles were correlates with clinical parameters. 

A preliminary survival analysis was performed on the 33 serous cancers within this 
cohort The median follow-up time for these patients was 25.5 months from the date of 
primary laparotomy to the date of last follow-up or the date of death, and 21 of these 
20 patients (66%) were deceased from causes related to their malignancy. 

Preliminary analysis of the expression profiles of these tumors identified several potential 
gene clusters that were associated with an increased risk of biochemical and clinical 
recurrence and overall survival, including the EDD gene (SEQ ID NO: 63). Exemplary 
25 prognostic markers for detecting ovarian cancer are shown in Tables 1 and 3. Preferred 
markers are indicated in Table 3. 

Using immunohistochemical analysis two genes have been confirmed to be upregulated 
in serous ovarian cancer. In particular, sFRP4, a negative signalling protein of the Wnt 
30 pathway, and SOCS3, a negative signaller of IL-6 induced signalling are specifically 
upregulated in serous ovarian cancer when compared to normal ovarian tissue (Figure 
4A). 

Furthermore, using clinical patient data and correlating this information with gene 
35 expression levels using a Cox proportional hazards model, it has been shown that high 
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expression of sFRP4 correlates with a poor outcome in patients (n=127) with serous 
ovarian cancer (p=0.0056) (Figure 4B). . 

EXAMPLE 4 

Validation of gene expression profiling results using quantitative RT-PCR 
Candidate diagnostic genes are screened by quantitative RT-PCR against ovarian 
cancer cell lines to both validate the transcript profiling data (ie check their up- or down- 
regulation). Candidate diagnostic genes are screened using mRNA isolated from a panel 
of 9 ovarian tumour cell lines. (A2780, SKOV3, OVCAR-3, IGROV-1, CAOV3, OV-90, 
SW626, TOV-21G and TOV-1 12D), in addition to several other tumour cell lines including 
lines derived from breast, prostate and colorectal tumours, and immortalised (non- 
transformed) human ovarian surface epithelial cells and a primary normal breast 
epithelial cell line (184). 

Total RNA is isolated from the normal and tumour cell lines, reverse transcribed into 
cDNA and used as template in a quantitative PCR using a LightCycler system (Roche 
Diagnostics). The relative amount of each gene product is determined by comparison to 
a standard housekeeping gene (GAPDH). 

EXAMPLE 5 

Identification of Novel Genes for Diagnosis of Ovarian Cancer 
We identified candidate genes with diagnostic potential from our list of aberrantly 
regulated genes , by applying the following selection procedure: genes with a good 
transcript profile and low p-value (ie highly significantly up- or down-regulated in ovarian 
cancer, as determined in Example 1); and mapping to areas of the genome that have 
been shown to be amplified or lost in ovarian cancer. Accordingly, it is likely that these 
genes are involved in the development and progression of ovarian cancer (ie putative 
oncogenes and tumour suppressor genes). Additional parameters for analysis included 
known or putative function in oncogenesis (eg signal transduction, regulation of cellular 
proliferation, apoptosis etc); and association with other forms of other tumours. Genes 
identified in this analysis are shown in Table 3. 

One method for the diagnosis of cancer comprises detecting modified DNA shed by the 
developing tumour into the blood stream. This can include the detection of mutations in 
both oncogenes and tumour suppressor genes involved in the development and 
progression of ovarian cancer. Furthermore, it has been recently shown that aberrant 
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methylation of tumour suppressor genes, specifically hypermethylation of their gene 
promoters, frequently accompanies gene silencing in cancers, and indeed in some cases 
appears to be the predominant mechanism of gene silencing. 

Combined with the knowledge of tumour nucleic acids circulating in the blood that reflect 
the biological characteristics of a tumour, the detection of methylation-specific tumour 
suppressor gene signatures for any given tumour type has promise as a specific and 
sensitive molecular test for detecting and monitoring cancer. Aberrant methylation is a 
frequent epigenetic event in epithelial ovarian cancer and many candidate tumour 
suppressor genes of epithelial ovarian cancer have been shown to be hypermethylated in 
epithelial ovarian cancer, such as, for example BRCA1. 

, In particular, expression of the candidate tumor suppressor gene MCC, has been shown 
to be down-regulated in epithelial ovarian cancer compared to normal ovarian tissue. 
MCC appears to be involved in critical cell growth regulatory processes and maps to a 
chromosomal region hypothesised as containing a tumor suppressor gene in ovarian 
cancer. Furthermore, we have identified a CpG island within the predicted promoter 
sequence of the MCC gene, a critical feature of genes that are subject to gene silencing 
by hypermethylation and a known characteristic of tumor suppressor genes. Taken 
together these data strongly implicate MCC as a candidate tumor suppressor gene 
involved in epithelial ovarian cancer. 
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Table 4 



Correlation of expression between normal ovarian surface epithelium (OSE), non- 





CA125 


MUC-1 


E- 
cadherin 


CLDN3 


Ep-CAM 


SOX17 


OSE vs IC 


<0.0001 


<0.0001 


0.7251 


0.6132 


0.1573 


0.0854 


OSE vs. BL 


0.1765 


<0.0001 


0.0307 


0.3633 


0.0005 


0.2287 


OSE vs. CA 


0.5443 


<0.0001 


0.1687 


0.0008 


O.0001 


0.6900 


IC vs. BL 


<0.0001 


<0.0001 . 


0.1116 


0.7849 


0.0913 


0.2530 


IC vs. CA 


<o:oooi 


0.2707 


0.4147 


0.0071 


0.0002 


0.0544 


BLvs.CA 


0.0001 


<0.0001 


0.0615 


O.0001 


0.0011 


0.0152 



Table 5 

Correlation of gene expression with patient outcome (univariate 
analysis ie. t expression alone without the influence of covariates) 

Univariate analysis for cllnicopathological variables and CLDN3, Ep-CAM. SOX17. CA125, MUC1 and E-cadherin 
immunoreactivity with survival and relapse in 156 patients with epithelial ovarian cancer 





Disease Specific Survival 








Variable 


Univariate 




Univariate 




Hazards ratio (95% Ci) 


p-value 


Hazards ratio (95% CI) 


p-value 


Pathological tumor stage — — . 










Stage 1 -3bvs. 3c -4b 


5.89 (3.214-10.79) 


<0.000i 


7.37 (3.26-16.63) 


<aoooi 


Tumor grade 






BL and G1 vs. G2 and G3 

Age 


5.508 (2.745-11.052) 


OXOOI 


7.02 (2.76-17.82) 


<0.0001 


<50 vs. >=50 


0.533 (0.288-0.988) 


0.0458 


0.62 (0.29-1.33) 


0.2221 


Residual Disease 








RD<1cmvs. >=1cm 


4.192(2.671-6.580) 


O.0001 


4.17 (2.30-7.55) 


<0.0001 


CA125 level at diagnosis 






CA125<500vs. >500U/ml 


1.843 (1.102-3.080) 


0.0197 


2.292 (1.19-4.40) 


0.0128 


Performance Status 






PS<1 vs. >1 


0.270(0.133-0.549) 


0.0003 


0.53 (0.16-1.74) 


0.2965 


CLDN3 expression 










Membranous Score 0 vs. >0 


2.794(1.012-7.718) 


0.0474 


2.521 (0.908-6.998) 


0.0758 


Membranous Score <1 vs. >1 


1.309(0.763-2^46) 


0.3285 


1.952(1.103-3.457) 


0.0217 


Ep-CAM expression 








Membranous Score <1vs. >1 


1.460(0.809-2.634) 


0.2093 


2.041 (0.997-4.177) 


0.0509 


Membranous Score <2vs. >2 


1.041 (0.634-1.711) 


0.873 


1.449 (0.845-2.487) 


0.1779 


SOX17 expression 








Nuclear membranous Score Ovs. >0 


0.839 (0.514-1.368) 


0.481 


1.311 (0.728-2.358) 


0.3667 


Nuclear membranous Score <1vs. >1 


1.40/(0.615-3.218) 


0.4183 


1.037 (0.380-2.829) 


0.9437 


CA125 expression 








Membranous apical Score Ovs. >0 


2.581 (1.393-4.781) 


0.0026 


2.725(1.218-6.093) 


0:0146 


Membranous apical Score <1vs. >1 


1.637 (1.045-2.564) 


0.0313 


1.298(0.731-2.307) 


0.3737 


MUC1 expression 








Membranous apical Score Ovs. >0 


2.479 (0.343-17.898) 


0.368 


HA 




Membranous apical Score <1vs. >1 


3.745 (1.176-11.926) 


0.0254 


6.432 (1.562-26.483) 


0.0099 


Membranous apical Score <2vs. >2 


1.814 (0.898-3.664) 


0.0969 


3.893 (1.552-9.766) 


0.0038 


E-cadherin expression 






Membranous Score Ovs. >0 


0.806 (0.493-1.318) 


0.3892 


0.837(0.477-1.467) 


0.5341 


Membranous Score <1vs. >1 


1.331 (0.532-3.333) 


0.5411 


0.847 (0.263-2.731) 


0.7814 


Membranous Score <2vs. >2 


0.593 (0.082-4.284) 


0.6041 


0.913(0.125-6.646) 


0.9284 
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Table 6 

Correlation of gene expression with patient outcome (multivariate 
analysis ie looking at expression incorporating the influence of covariates) 



Multivariate analysis for univariate significant dmicopathological variables and CLDN3, Ep-CAM, SOX17, CA125. MUC1 
and E-cadherln immunoreactiv'rty with survival and relapse in 156 patients with epithelial ovarian cancer 



Disease Specific Survival 



Relapse Free Survival 



Variable 



Multivariate 
Hazards ratio (95% CI) 



Univariate 
Devalue Hazards ratio (95% CU 



p-vatue 



Pathological tumor stage 

Stage 1- 3b vs. 3c -4b 
Tumor grade 

BL and G1 Vs. G2andG3 

Age 

<50 vs.>=50 
Residual Disease 

RD<1cm vs. >=1cm 
CA125 tevei at diagnosis 

CA125<500 vs. >5Q0U/ml 
Performance Status 

PS<1 vs. >1 

CLDN3 expression 

Membranous Score 0 vs. >Q 
Membranous Score <1 vs. >1 

CA125 expression 

Membranous apical Score 0 vs. >0 
Membranous apical Score <1 vs. >1 

MUC1 expression 

Membranous apical Score 0 vs. >0 
Membranous apical Score <1 vs. >1 
Membranous apical Score <2 vs. >2 



5.66(2.467-13.012) 
4.919(2.080-11.633) 
0.951 (0.482-1.877) 
2^74(1.783-4.959) 
1.148 (0.625-2.109) 
0.286(0.136-0.601) 

1.165 (0.325-4.183) 



0.917(0.415-2.025) 
1.664 (0.976-2,837) 



0.678(0.255-1.804) 



O.0001 
0.0003 
0.8853 

OJJ001 
0.6563 
0.0009 

0.8145 



0.8302 
0.0612 



0.4361 



5.192(1.860-14.496) 
7.989(2.385-26.760) 

2.779(1.433-5.393) 
1.289 (0.659-2.520) 



0.953(0.473-1.919) 
0.693 (0.271-1.768) 



0X017 



0.0025 

0.4587 



0.8918 
0.4427 
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WE CUIM: 

1. A method of detecting an ovarian cancer-associated transcript in a biological 
sample, the method comprising contacting the biological sample with a polynucleotide 

5 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown 
in Table 1 or 2 or 3. 

2. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 

10 tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian cancer, and wherein said nucleic acid probe comprises a 
15 sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 
20 (ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71 , 73, 75, 77, 79, 81 and 83; 
25 (iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82 and 84; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv), 

35 
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3. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 

5 hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
10 selected from the group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21. 

23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 

15 group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

20 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 

25 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

4. The method of claim 2 or 3 wherein the hybridization is enhanced in the sample 
from the subject being tested compared to the hybridization obtained for a sample from a 

30 control subject not having ovarian cancer. * 

5. The method of claim 2 or 3 wherein the hybridization is reduced in the sample 
from the subject being tested compared to the hybridization obtained for a sample from a 
control subject not having ovarian cancer. 
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6. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an enhanced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 or 2 other than a nucleic acid having an Accession 
Number selected from the group consisting of NMJJ22117, NMJ305460, 
NM_002387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NMJ)22117, NMJ)05460, NM_002387, AI745249 and 
AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NMJ)22117 f NMJ)05460, NMJ)02387, AI745249 and 
AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii)or(iii)or(iv). 

7. The method of claim 6 wherein the nucleic acid probe comprises a sequence 
selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 
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(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

5 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; 
and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
10 (ii) or (iii) or (iv). 

8. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 

15 hybridization to occur and then detecting the hybridization wherein a reduced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

20 (i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of NM JJ221 17, NM_005460, NM J302387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

25 and having an Accession Number selected from the group consisting of 

NM_022117, NMJX)5460, NM_002387, AI745249 and AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of 

30 NMJ)221 17, NM_005460, NMJ302387, AI745249 and AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (Iv). 



9. The method of claim 8 wherein the nucleic acid probe comprises a sequence 
35 selected from the group consisting of: 
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(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1 , 3, and 5; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, and 5; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1 , 3, and 5; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

10. The method according to any one of claims 1 to 9 wherein the ovarian cancer that 
is diagnosed is an epithelial ovarian cancer. 

11. The method according to any one of claims 1 to 9 wherein the ovarian cancer that 
is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell and undifferentiated adenocarcinoma. 

12. The method according to claim 1 1 wherein the ovarian cancer that is diagnosed is 
selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer and clear cell ovarian cancer. 

13. A method of diagnosing a serous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a serous ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 2 or as set forth in Table 1 and having an Accession 
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Number selected from the group consisting of: U62801, D49441, X51630, And 
AB018305; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 2 
or as set forth in Table 1 and having an Accession Number selected from the 
group consisting of. U62801. D49441; X51630, And AB018305; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 2 or as set forth in Table 1 and having an Accession Number selected from 
the group consisting of: U62801, D49441, X51630, And AB018305; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

14. A method of diagnosing a mucinous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a mucinous ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: NM_006149, AA315933, U47732, NMJ)05588, AW503395, 
NM_004063, AI073913, AI928445, NMJ)22454, W40460, AA1 32961 and 
AF1 11856; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: 
NM_006149, AA315933, U47732, NMJ)05588, AW503395, NM_004063, 
AI073913, AI928445, NM_022454, W40460, AA1 32961 and AF111856; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of: 
NM_006149, AA315933, U47732, NM_005588, AW503395, NM_004063, 
A1073913, AI928445, NM_022454, W40460, AA132961 and AF111856; and 
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(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15. The method of claim 14 wherein the nucleic acid probe comprises a sequence 
5 selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from SEQ ID NO: 
57 or 59 or 61; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from SEQ ID NO: 57 or 59 or 61; 

10 (ili) a sequence that is at least about 80% identical to SEQ ID NO: 57 or 59 or 61 ; 

(iv) a sequence that encodes the amino acid sequence set forth in SEQ ID NO: 58 or 
60 or 62; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15 

16. The method according to any one of claims 1 to 15 comprising performing a PCR 
reaction. 

17. The method according to any one of claims 1 to 16 comprising performing a 
20 nucleic acid hybridization. 

18. A method of detecting an ovarian cancer-associated polypeptide in a biological 
sample the method comprising contacting the biological sample with an antibody that 
binds specifically to an ovarian cancer-associiated polypeptide in the biological sample, 

25 the polypeptide being encoded by a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-3. 



19. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
. 30 tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a modified level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
the subject being tested has an ovarian cancer, and wherein said antibody binds to a 
35 polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to a sequence 
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selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 
72, 74, 76, 78, 80, 82 and 84. 

5 20. The method of claim 19 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 
a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

10 

21. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

15 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a. control subject not having ovarian cancer 
indicates that the subject being tested has an ovarian cancer, and wherein said antibody 
binds to a polypeptide comprising an amino acid sequence comprising at least about 10 
contiguous amino acid residues of a polypeptide encoded by a nucleic acid set forth in 

20 Table 1 or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NMJ)221 17, NMJ)05460, NM_002387, AI745249 and AI694200. 

22. The . method of claim 21 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 

25 a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

23. A method of diagnosing an ovarian cancer in a human or animal subject being 
30 tested said method comprising contacting a biological sample from said subject being 

tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a reduced level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
35 the subject being tested has an ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
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amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 
having an Accession Number selected from the group consisting of NM_022117, 
NM_005460, NM_002387, AI745249 and AI694200. 

24. The method of claim 23 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 
a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6. 

25. The method according to any one of claims 19 to 24 wherein the ovarian cancer 
that is diagnosed is an epithelial ovarian cancer. 

26. The method according to any one of claims 19 to 24 wherein the ovarian cancer 
that is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell and undifferentiated adenocarcinoma. 

27. The method according to claim 26 wherein the ovarian cancer that is diagnosed is 
selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer and clear cell ovarian cancer. 

28. A method of diagnosing a serous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with an antibody for a. time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein a modified level of the. 
antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a serous ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 2 or as set forth in Table 1 and having an Accession Number selected from 
the group consisting of: U62801, D49441, X51630, And AB018305. 

29. A method of diagnosing a mucinous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
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being tested with an antibody for a time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein a reduced level of the 
antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
5 indicates that the subject being tested has a mucinous ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 1 and having an Accession Number selected from the group consisting of: 
NM_006149, AA315933, U47732, NMJ)05588, AW503395, NM_004063, AI073913, 
10 AI928445, NM_022454, W40460, AA1 32961 and AF1 1 1 856. 

30. The method according to claim 29 wherein the antibody binds to a polypeptide 
comprising an amino acid sequence comprising at least about 10 contiguous amino acid 
residues of a sequence having at least about 80% identity to SEQ ID NO: 58 or 60 or 62. 

15 

31. A method of detecting an ovarian cancer-associated antibody in a biological 
sample the method comprising contacting the biological sample with a polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1-3, wherein the polypeptide specifically 

20 binds to the ovarian cancer-associated antibody. 

32. The method according to any one of claims 1 to 31 wherein the biological sample 
is contacted with a plurality of nucleic acid probes and/or antibodies and/or polypeptides. 

25 33. The method according to any one of claims 1 to 32 wherein the subject being 
tested is a patient undergoing a therapeutic regimen to treat ovarian cancer. 

34. The method according to any one of claims 1 to 32 wherein the subject being 
tested is a subject suspected of having ovarian cancer. 

30 

35. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer, 
the method comprising: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

35 (ii) determining the level of a ovarian cancer-associated transcript in the 

biological sample by contacting the biological sample with a polynucleotide 
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that selectively hybridizes to a sequence having at least about 80% identity 
to a sequence as shown, in any one of Tables 1-3, thereby monitoring the 
efficacy of the therapy. 

5 36. The method according to claim 35 further comprising comparing the level of the 
ovarian cancer-associated transcript to a level of the ovarian cancer-associated transcript 
in a biological sample from the patient prior to, or earlier in, the therapeutic treatment 

37. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer, 
10 the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated antibody in the 
biological sample by contacting the biological sample with a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at 

least 80% identical to a sequence as shown in Tables 1-3, wherein the 
polypeptide specifically binds to the ovarian cancer-associated antibody, 
thereby monitoring the efficacy of the therapy. 

20 38. The method of claim 37 further comprising comparing the level of the ovarian 
cancer-associated antibody to a level of the ovarian cancer-associated antibody in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment 

39. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer, 
25 the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated polypeptide in the 
biological sample by contacting the biological sample with an antibody, wherein 

30 the antibody specifically binds to a polypeptide encoded by a polynucleotide that 

selectively hybridizes to a sequence at least 80% identical to a sequence as 
shown in Tables 1-3, thereby monitoring the efficacy of the therapy. 
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40. The method of claim 39 further comprising comparing the level of the ovarian 
cancer-associated polypeptide to a level of the ovarian cancer-associated polypeptide in 
a biological sample from the patient prior to, or earlier in, the therapeutic treatment 

5 41. The method according to any one of claims 35 to 40 wherein the ovarian cancer 
that is diagnosed is an epithelial ovarian cancer. 

42. The method according to any one of claims 35 to 41 wherein the ovarian cancer 
that is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
10 invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 

endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell and undifferentiated adenocarcinoma. 

43. The method according to claim 42 wherein the ovarian cancer that is diagnosed is 
15 selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 

endometrioid ovarian cancer and clear cell ovarian cancer. 

44. A method of determining the likelihood of survival of a subject suffering from an 
ovarian cancer, said method comprising contacting a biological sample from said subject 

20 being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a poor probability of survival, and wherein said nucleic acid probe comprises a 
25 sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: NMJX>3014, AA046217, NMJ)159D2, T83882, AB040888, 
AA628980, AI623351, AW6T4420, AA243499, AF251237, AI970797, AF145713, 
30 X78565, T97307, BE243845, AW068302, AL1 33561, BE313555, X07820, 

AI973016, AF084545, U41518, Z11894, AW138190, BE086548, W47196, 
AI796870, X02761, AW968613, AW972565, AF045229, AW953853, U52426. 
F06700, AI798863, H52761, BE546947, AU076643, U20536, AA581602, 
AJ245210, X65965, AI806770, BE386490, AW581992, U77534, AL034417, 
35 L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 

BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NMJ)14992, 
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BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
BE614567, AL079658, NM_002776, BE261944, NM_006379, AI002238, X81789, 
NMJJ02122, AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
N72403, BE003054, ALQ35588, AI080491, AW770994, H24177, AF146761, 
5 NM_001955, AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 

10 AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, 

BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 
AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863. 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965. 

15 AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944. 

W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 
M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 
R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 

20 AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 

BE003054, AL035588. AI080491, AW770994, H24177, AF146761, NM_001955, 
AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting of: 

NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, 
BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 

30 AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863, 

H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965. 
AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944. 
W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 
M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 

35 R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 

NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 
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AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
BE003054, AL035588, AI080491, AW770994, H24177, AF146761, NMJXM955, 
AI680737, AI752666, AA505445, BE246649, and NM J)03955; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv), 

45. The method of claim 44 wherein the nucleic acid probe comprises a sequence 
selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 63, 65, 67, 69, 71 , and 73; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

46. A method of determining the likelihood of survival of a subject suffering from an 
ovarian cancer, said method comprising contacting a biological sample from said subject 
being tested with an antibody for a time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein an enhanced level of 
the antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has has a poor probability of survival, and wherein 
said antibody binds to a polypeptide comprising an amino acid sequence comprising at 
least about 10 contiguous amino acid residues of a sequence encoded by a nucleic acid 
set forth in Table 1 and having an Accession Number selected from the group consisting 
of: NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, BE243845, 
AW068302, AL133561, BE313555, X0782Q, AI973016, AF084545, U41518, Z11894, 
AW138190, BE086548, W47196, AI796870, X02761, AW968613, AW972565, 
AF045229, AW953853, U52426, F06700, AI798863, H52761, BE546947, AU076643, 
U20536, AA581602, AJ245210, X65965, AI806770, BE386490, AW581992, U77534, 
AL034417, L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 
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BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NM_014992, 
BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
BE614567, AL079658, NMJ302776, BE261944, NM_006379, AI002238, X81789, 
NMJ)02122, AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
N72403, BE003054, AL035588, AI080491, AW770994, H24177, AF146761, 
NM_001955, AI680737, AI752666, AA505445, BE246649, and NM_003955. 

47. The method of claim 46 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 
a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

48. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 
subject being tested with a nucleic acid probe for a time and under conditions sufficient 
for hybridization to occur and then detecting the hybridization wherein an elevated level 
of hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a poor probability of survival, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 71 or 73; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO: 71 or 73; 

(iii) a sequence that is at least about 80% identical to (i) or (ii) and encoding an sFRP 
protein or a SOCS3 protein; 

(iv) a sequence that encodes a polypeptide comprising the amino acid sequence set 
forth in SEQ ID NO: 72 or 74; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

49. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 
subject being tested with an antibody for a time and under conditions sufficient for an 
antigen-antibody complex to form and then detecting the complex wherein an enhanced 
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level of the antigen-antibody complex for the subject being tested compared to the 
amount of the antigen-antibody complex formed for a control subject not having ovarian 
cancer indicates that the subject being tested has a poor probability of survival, and 
wherein said antibody binds to an sFRP polypeptide comprising the amino acid 
5 sequence set forth in SEQ ID NO: 72 or a SOCS3 polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO: 74. 

50. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 

10 subject being tested with at least two antibodies for a time and under conditions sufficient 
for antigen-antibody complexes to form and then detecting the complexes wherein an 
enhanced level of the antigen-antibody complexes for the subject being tested compared 
to the amount of the antigen-antibody complexes formed for a control subject not having 
ovarian cancer indicates that the subject being tested has a poor probability of survival, 

15 and wherein one antibody binds to an sFRP polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO: 72 and wherein one antibody binds to a SOCS3 
polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 74. 

51. The method according to any one of claims 44 to 47 wherein the ovarian cancer 
20 is an epithelial ovarian cancer. 

52. The method according to any one of claims 44 to 47 wherein the ovarian cancer is 
selected from the group consisting of serous ovarian cancer, non-invasive ovarian 
cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, endometrioid 

25 ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, Brenner cell 
and undifferentiated adenocarcinoma. 

53. The method according to claim 52 wherein the ovarian cancer is selected from the 
group consisting of serous ovarian cancer, mucinous ovarian cancer, endometrioid 

30 ovarian cancer and clear cell ovarian cancer. 

54. A method of determining the likelihood that a subject will suffer from a recurrence 
of an ovarian cancer, said method comprising contacting a biological sample from said 
subject being tested with a nucleic acid probe for a time and under conditions sufficient 

35 for hybridization to occur and then detecting the hybridization wherein an elevated level 
of hybridization of the probe for the subject being tested compared to the hybridization 
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obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a high probability of recurrence, and wherein said nucleic acid probe 
comprises a sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, 
R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF126245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589. 
AW957446, AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 
AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 
AA188763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 
AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL133572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA130986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, A1368826, AI718702, 
AI827248, AK002039, AL109791, AW090198, AW296454, AW445034, 
AW452948, AW470411, AW885727, AW970859, AW979189, BE165866, 
BE1 75582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 
X76732andZ45051, 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: M86849, 
AW963419, BE298665, AK000637, BE077546, T97307, R24601, BE090176, 
AA393907, W28729, BE313754, AW673081, AA356694, L08239, BE397649, 
NM_012317, NM_000947, AJ250562, AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF126245, L10343, NM_002514, 
AI863735, NM_005397, W26391, H15474, U51166, AA243499, AW408807, 
AI738719, AB040888, BE313077, AI677897, C14898, AI821730, AF007393, 
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H65423, N46243, AA095971, U20350, NM_005756. D19589, AW957446. 
AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, BE464341, 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 
AU076611, AW968613..AL353944, BE614149, AA292998, H12912, AA188763, 
5 AK000596, AI970797, AW519204, Z42387, AF145713, M972412, AK001564, 

AW959861, BE313555, W25005, AI193356, AF111106, AI130740, AA985190, 
BE221880, AF084545, R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL133572, AI497778, AI745379, U51712, AW375974, AF251237, 
NM_000636, AA130986, AA216363. AA628980, AA811657, AA897108, 

10 AB040888, AF212225. AI089575, AI282028, AI368826. AI718702, AI827248, 

-AK002039, AL109791, AW090198, AW296454, AW445034, AW452948, 
■AW470411, AW885727, AW970859, AW979189. BE1 65866, BE175582, 
BE242587, BE271927, BE439580, BE464016, D63216, F34856, M83822, 
N33937, N49068, N51357, N80486, NM_000954, NM_005756, NM_016652, 

15 R26584, R31178, W05391, W25005^ W45393, W68815, X65965, X76732 and 

Z45051; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting 

20 of:M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 

BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF126245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 

25 AW408807, AI738719. AB040888, BE313077, AI677897, C14898, AJ821730, 

AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, 
BE464341. AL080235. AI557212, X75208, AA628980, BE242587, NM_005512, 
AW953853, AU076611, AW968613, AL353944, BE614149, AA292998. H12912, 

30 AA188763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 

AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL133572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA130986, AA216363, AA628980, AA811657, 

35 AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 

AI827248, AK002039, AL109791, AW090198, AW296454, AW445034, 
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AW452948, AW470411, AW885727, AW970859, AW979189, BE165866, 
BE175582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 
5 X76732andZ45051;and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

55. The method of claim 54 determining the likelihood that a subject will suffer from a 

10 recurrence of an ovarian cancer, said method comprising contacting a biological sample 
from said subject being tested with an antibody for a time and under conditions sufficient 
for an antigen-antibody complex to form and then detecting the complex wherein an 
enhanced level of the antigen-antibody complex for the subject being tested compared to 
the amount of the antigen-antibody complex formed for a control subject not having 

15 ovarian cancer indicates that the subject being tested has a high probability of 
recurrence, and wherein said antibody binds to a polypeptide comprising an amino acid 
sequence comprising at least about 10 contiguous amino acid residues of a sequence 
encoded by a nucleic acid set forth in Table 1 and having an Accession Number selected 
from the group consisting of: M86849, AW963419, BE298665, AK000637, BE077546, 

20 T97307, R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562. AL040183, BE207573, 
BE564162, BE439580, AW067800. AA569756, AW138190. AF126245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 

25 AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, AW957446, 
AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, BE464341, 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 
AU076611, AW968613, AL353944, BE614149, AA292998, H12912, AA188763, 
AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, AK001564, 

30 AW959861, BE313555, W25005, AI193356, AF111106, AI130740, AA985190, 
BE221880, AF084545, R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL133572, AI497778, AI745379, U51712, AW375974, AF251237, 
NM_000636, AA130986, AA216363, AA628980, AA811657, AA897108, AB040888, 
AF212225, AI089575, AI282028, AI368826, AI718702, AI827248, AK002039, 

35 AL1 09791, AW090198, AW296454, AW445034, AW452948, AW470411, AW885727, 
AW970859, AW979189, BE1 65866, BE1 75582, BE242587, BE271927, BE439580, 
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BE464016, D63216, F34856, M83822, N33937, N49068, N51357, N80486, NM__000954, 
NMJ)05756, NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, 
X65965,X76732andZ45051. . 

5 56. The method according to claim 54 or 55 wherein the ovarian cancer is an epithelial 
ovarian cancer. 

57. The method according to any one of claims 54 to 56 wherein the ovarian cancer is 
selected from the group consisting of serous ovarian cancer, non-invasive ovarian 

10 cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, endometrioid 
ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, Brenner cell 
and undifferentiated adenocarcinoma. 

58. The method according to claim 57 wherein the ovarian cancer is selected from the 
15 group consisting of serous ovarian cancer, mucinous ovarian cancer, endometrioid 

ovarian cancer and clear cell ovarian cancer. 

59. The method according to any one of claims 35 to 58 wherein the biological sample 
is contacted with a plurality of nucleic acid probes and/or antibodies and/or polypeptides. 

20 

60. A method for identifying a compound that modulates an ovarian cancer- 
associated polypeptide, the method comprising : 

(i) contacting the compound with a ovarian cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

25 wr at least 80% identical to a sequence as shown in Tables 1-3; and 

(ii) determining the functional effect of the compound upon the polypeptide. 



61. A method for determining a candidate compound for the treatment of ovarian 
cancer comprising : 

30 (0 administering a test compound to a mammal having ovarian cancer or a cell 

isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-3 in a treated cell or mammal with the level of gene expression of the 
35 polynucleotide in a control cell or mammal, wherein a test compound that 
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modulates the level of expression of the polynucleotide is a candidate for the 
treatment of ovarian cancer. 

62. An assay device for use in the diagnosis or prognosis of ovarian cancer, said 
5 device comprising a plurality of polynucleotides immobilized to a solid phase, wherein 

each of said polnucleotides consists of a gene as listed in any one of Tables 1-3. 

63. The device of claim 62 consisting of a substantially planar chip. 

10 64. An assay device for use in the diagnosis or prognosis of ovarian cancer, said 
device comprising a plurality of different antibodies immobilized to a solid phase, wherein 
each of said antibodies binds to a polypeptide listed in Tables 1-3. 

65. The device of claim 64 consisting of a substantially planar chip. 

66. Use of a polynucleotide as set forth in any one of Tables 1-3 in the diagnosis or 
prognosis of ovarian cancer or for the preparation of a medicament for the treatment of 
ovarian cancer. 

67. Use of a vector comprising a polynucleotide as set forth in any one of Tables 1-3 
in the diagnosis or prognosis of ovarian cancer or for the preparation of a medicament for 
the treatment of ovarian cancer. 

68. Use of an isolated polypeptide as set forth in any one of Tables 1-3 in the 
diagnosis or prognosis of ovarian cancer or for the preparation of a medicament for the 
treatment of ovarian cancer. 

69. Use of an antibody that binds to ah isolated polypeptide as set forth in any one of 
Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the preparation of a 
medicament for the treatment of ovarian cancer. 

70. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising determining aberrant methylation in a promoter sequence 
that regulates expression of a tumor suppressor gene in a biological sample from said 
subject compared to the methylation of the promoter in nucleic acid obtained for a control 
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subject not having ovarian cancer wherein said aberrant methylation indicates that the 
subject being tested has an ovarian ovarian cancer. 

71. The method of claim 70 wherein hypermethylation of the promoter sequence is 
5 determined. 

72. The method of claim 70 or 71 wherein the methylation is determined in the 
promoter region that regulates expression of an MCC gene comprising a sequence 
selected from the group consisting oft 

10 (i) the nucleotide sequence set forth as SEQ ID NO: 3; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
v to the nucleotide sequence set forth as SEQ ID NO: 3; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleotide sequence set 
15 forth as SEQ ID NO: 3; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or(iv). 



73. The method according to any one of claims 70 to 72 wherein the ovarian cancer 
20 that is diagnosed is an epithelial ovarian cancer. 
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SEQUENCE LISTING j 

<110> Garvan InSstitute of Medical Research • 

<120> Methods for the diagnosis and prognosis of ovarian cancer j 

<130> 501731/MRO 

<150> AU2002951346 
<151> 2002-09-05 

<160> 84 

<170> Patentln version 3.1 

<210> 1 

<211> 3432 

<212> DHA 

<213> NMJ)05460 SNCAIP, synphilin 

<220> 
<221> CDS 
<222> (94) . . (2850) 
<223> 

<400> 1 

gtcagtccct tcgcgctcct gagccgccgg cgcgccgggc gccccgggaa tttataagta 60 

tttgaccgta ctcaaaatgt gcaaggaaga ata atg gaa gcc cct gaa tac ctt 114 

Met Glu Ala Pro Glu Tyr Leu 

1*5 

gat ttg gat gaa att gac ttt agt gat gac ata tct tat tea gtc aca 162 
Asp Leu Asp Glu He Asp Phe Ser Asp Asp He Ser Tyr Ser Val Thr 
10 15 20 

tea etc aag acg ate cca gaa ctg tgc cga aga tgt gat acg caa aac 210 
Ser Leu Lys Thr He Pro Glu Leu Cys Arg Arg Cys Asp Thr Gin Asn 
25 30 35 

gaa gac aga tea get tct age tct age tgg aat tgt ggc ate tea act 258 
Glu Asp Arg Ser Ala Ser Ser Ser Ser Trp Asn Cys Gly He Ser Thr 
40 45 50 ~ 55 

ctt att aca aac acg caa aag ccc aca gga ate get gat gtg tac agt 306 
Leu He Thr Asn Thr Gin Lys Pro Thr Gly He Ala Asp Val Tyr Ser 
60 65 70 

aag ttc cgc cca gtg aag egg gtt teg cca ctg aaa cat cag cca gag 354. 
Lys Phe Arg Pro Val Lys Arg Val Ser Pro Leu Lys His Gin Pro Glu 
75 80 85 

act ctg gag aac aat gaa agt gat gac caa aag aac cag aaa gtg gtt 402 
Thr Leu Glu Asn Asn Glu Ser Asp Asp Gin Lys Asn Gin Lys Val Val 
90 95 100 

gag tac cag aaa ggg ggt gag tct gac ctg ggc ccc cag cct cag gag 450 
Glu Tyr Gin Lys Gly Gly Glu Ser Asp Leu Gly Pro Gin Pro Gin Glu 
105 HO H5 

ctt ggc cct gga gat gga gtg ggc ggc cca cca ggt aag age tct gag 498 
Leu Gly Pro Gly Asp Gly Val Gly Gly Pro Pro Gly Lys Ser Ser Glu 
120 125 130 135 

ccc age aca teg ctg ggt gaa ctg gag cac tac gac etc gac atg gat 546 
Pro Ser Thr Ser Leu Gly Glu Leu Glu His Tyr Asp Leu Asp Met Asp 
140 145 150 
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gag att ctg gat gtg cct tat att aaa -tec agt cag cag ctt gec tct 594 
Glu lie Leu Asp Val Pro Tyr He Lys Ser Ser Gin Gin Leu Ala Ser 
155 160. 165 

I? 1? f 39 gt ? 3Ct tCa gaa aaa aga att "9 99<= tta tgc aca acc 642 
Phe Thr Lys Val Thr Ser Glu Lys Arg He Leu Gly Leu Cys Thr Thr 
170 175 180 

ate aat ggc ctt tct ggc aaa gec tgc tct aca gga agt tct gag age 690 
He Asn Gly Leu Ser Gly Lys Ala Cys Ser Thr Gly Ser Ser Glu Ser 
185 190 19S 

tea tea tec aac atg gca cca ttt tgt gtt ctt tct ccc gtg aaa age 738 
Ser Ser. Ser Asn Met Ala Pro Phe Cys Val Leu Ser Pro Val Lys Ser 
200 205 210 215 

cct cac ttg aga aaa gca tea get gtc ate cac gac cag cac aag ctg 786 
Pro His Leu Arg Lys Ala Ser Ala Val He His Asp Gin His Lys Leu 
220 225 230 

tec act gaa gaa acc gag ate tea cct cct ctg gtt aaa tgt ggc tct 834 
Ser Thr Glu Glu Thr Glu He Ser Pro Pro Leu Val Lys Cys Gly Ser 
235 240 245 

gca tat gag cct gaa aac cag agt aaa gac ttc eta aac aag aca ttt 882 
Ala Tyr Glu Pro Glu Asn Gin Ser Lys Asp Phe Leu Asn Lys Thr Phe 
250 255 260 

agt gat cct cat ggt cga aaa gtt gag aag aca aca cca gac tgc cag 930 
Ser Asp Pro His Gly Arg Lys Val Glu Lys Thr Thr Pro Asp Cys Gin 
265 270 275 

it?, 359 f?° tt° CaC ° ta Caa toa gca 9^ gaa tec aaa cca gaa 978 
Leu Arg Ala Phe His Leu Gin Ser Ser Ala Ala Glu Ser Lys Pro Glu 
280 285 290 2 95 

111 If r?° r ta C " aCC ag ° tCC Caa ggc coa gaa * aa !°26 

Glu Gin Val Ser Gly Leu Asn Arg Thr Ser Ser Gin Gly Pro Glu Glu 

300 305 310 

agg agt gag tat ctg aaa aaa gtg aaa age ate ttg aac att gtt aaa 1074 
Arg Ser Glu Tyr Leu Lys Lys Val Lys Ser He Leu Asn He Val Lys 
315 320 325 

? aa ?? a ^f g a ^° tCt Ct ° Ctg cca cac cta gct 9° a 9 ao a *t eta gac 1122 
Glu Gly Gin He Ser Leu Leu Pro His Leu Ala Ala Asp Asn Leu Asp 

330 335 340 

aaa att cac gac gaa aat gga aac aat cta tta cat att gcg gcg tea 1170 
Lys He His Asp Glu Asn Gly Asn Asn Leu Leu His He Ala Ala Ser 
34 5 350 355 

cag gga cac gca gag tgt cta cag cac etc act tct ttg atg gga gaa 1218 
Gin Gly His Ala Glu Cys Leu Gin His Leu Thr Ser Leu Met Gly Glu 
360 365 370 375 

gac tgc etc aat gag cgc aac act gag aag ttg act cca gca ggc ctg 1266 
Asp Cys Leu Asn Glu Arg Asn Thr Glu Lys Leu Thr Pro Ala Gly Leu 
3 80 385 390 

gee att aag aat ggt cag ttg gag tgc gta cgc tgg atg gtg age gaa 1314 
Ala lie Lys Asn Gly Gin Leu Glu Cys Val Arg Trp Met Val Ser Glu 
395 400 405 

aca gaa gec att gca gaa ctg agt tgt tct aag gat ttt cca age ctt 1362 
Thr Glu Ala He Ala Glu Leu Ser Cys Ser Lys Asp Phe Pro Ser Leu 
410 415 420 
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att cat tac gca ggt tgc tat ggc cag gaa aag att ctt ctg tgg ctt 1410 
He His Tyr Ala Gly Cys Tyr Gly Gin Glu Lys He Leu Leu Trp Leu 
425 430 435 

ctt cag ttt atg caa gaa cag ggc ate .teg ttg gat gaa gta gac cag 1458 
Leu Gin Phe Met Gin Glu Gin Gly He Ser Leu Asp Glu Val Asp Gin 
445 450 455 

gat ggc aac agt gec gtt cac gta gec tea cag cat ggc tac ctt gga 1506 
Asp Gly Asn Ser Ala Val His Val Ala Ser Gin His Gly Tyr Leu Gly 
460 465 470 

tgc ata cag acc ttg gtt gaa tat gga gca aat gtc acc atg cag aac 1554 
Cys He Gin Thr Leu Val Glu Tyr Gly Ala Asn Val Thr Met Gin Asn 
475 480 485 

cac get ggg gaa aag ccc tec cag age gcq gag egg cag ggg cac acc 1602 
His Ala Gly Glu Lys Pro Ser Gin Ser Ala Glu Arg Gin Gly His Thr 
490 495 500 

ctg tgc tec agg tac ctg gtg gtg gtg gag acc tgc atg teg ctg gec 1650 
Leu Cys Ser Arg Tyr Leu Val Val Val Glu Thr Cys Met Ser Leu Ala 
505 510 515 

tct caa gtg gtg. aag tta acc aag cag eta aag gaa caa aca gta gaa 1698 
Ser Gin Val Val Lys Leu Thr Lys Gin Leu Lys Glu Gin Thr Val Glu 
520 525 530 535 

cgt gtc acg ctg cag aac caa etc caa caa ttt eta gaa gec cag aaa 1746 
Arg Val Thr Leu Gin Asn Gin Leu Gin Gin Phe Leu Glu Ala Gin Lys 
540 545 550 

tea gag ggc aag tea etc cct tct tea ccc agt tea cca tec tea cct 1794 
Ser Glu Gly Lys Ser Leu Pro Ser Ser Pro Ser Ser Pro Ser Ser Pro 
555 560 565 

gec tec aga aag tec cag tgg aaa tct cca gat gca gat gat gat tct 
Ala Ser Arg Lys Ser Gin Trp Lys Ser Pro Asp Ala Asp Asp Asp Ser 
570 575 580 

gta gec aaa age aag cca gga gtc caa gag ggg att .cag gtt ctt gga 
Val Ala Lys Ser Lys Pro Gly Val Gin Glu Gly He Gin Val Leu Gly 
585 590 595 

age ctg tea gec tec age egg get aga ccc aaa gca aaa gat gaa gat 
Ser Leu Ser Ala Ser Ser Arg Ala Arg Pro Lys Ala Lys Asp Glu Asp 
600 605 610 615 

tct gat aaa ate tta cgc cag tta ttg gga aag gaa ate tea gaa aat 
Ser Asp Lys He Leu Arg Gin Leu Leu Gly Lys Glu He Ser Glu Asn 
620 625 630 

gtc tgc acc cag gaa aaa ctg tec ttg gaa ttc cag gat get cag get 
Val Cys Thr Gin Glu Lys Leu Ser Leu Glu Phe Gin Asp Ala Gin Ala 
635 640 645 

tec tct aga aat tct aaa aag ate cca ctg gag aag agg gaa ctg aag 
Ser Ser Arg Asn Ser Lys Lys He Pro Leu Glu Lys Arg Glu Leu Lys 
650 655 660 

tta gec agg ctg aga cag ctg atg cag agg tea ctg agt gag tct gac 2130 
Leu Ala Arg Leu Arg Gin Leu Met Gin Arg Ser Leu Ser Glu Ser Asp 
665 670 675 

aca gac tec aac aac tct gag gac ccc aag act acc cca gtg agg aag 2178 
Thr Asp Ser Asn Asn Ser Glu Asp Pro Lys Thr Thr Pro Val Arg Lys 
680 685 690 695 
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get gac cga cca agg ccg cag ccc att gta gaa age gta gag agt atg 
Ala Asp Arg Pro Arg Pro Gin Pro He Val Glu Ser Val Glu Ser Met 
700 705 710 

gac age gca gaa age ctg cac ctg atg att aag aaa cac ace ttg gca 
Asp Ser Ala Glu Ser Leu His Leu Met' He Lys Lys His Thr Leu Ala 
715 720 725 

tea ggg gga cgc agg ttt cct ttc age -ate aag gee tec aaa tec ctg 
Ser Gly Gly Arg Arg Phe Pro Phe Ser He Lys Ala Ser Lys Ser Leu 
730 735 740 

gat ggc cac age cca tct ecc ace tea gag age age gaa cca gac tta 
Asp Gly His Ser Pro Ser Pro Thr Ser Glu Ser Ser Glu Pro Asp Leu 
745 750 755 

gaa tec cag tat cca ggc tea ggg agt att cct cca aac cag ccc tct 
Glu Ser Gin Tyr Pro Gly Ser Gly Ser He Pro Pro Asn Gin Pro Ser 
760 765 770 775 

ggt gac cct cag cag ccc age cct gac agt act get gee cag aaa gtt 
Gly Asp Pro Gin Gin Pro Ser Pro Asp Ser Thr Ala Ala Gin Lys Val 
780 785 790 

gec aca agt ccc aag agt gee etc aag tct cca tct tec aag cgt agg 
Ala Thr Ser Pro Lys Ser Ala Leu Lys Ser Pro Ser Ser Lys Arg Arg 
795 800 805 

aca tct cag aac tta aaa ctg aga gtt ace ttt gag gag cct gtg gtg 
Thr Ser Gin Asn Leu Lys Leu Arg Val Thr Phe Glu Glu Pro Val Val 
810 815 820 

cag atg gag cag cct age ctt gaa ctg aat gga gaa aaa gac aaa gat 
Gin Met Glu Gin Pro Ser Leu Glu Leu Asn Gly Glu Lys Asp Lys Asp 
825 830 "835 



aag ggc agg act etc cag egg ace tec aca agt 
Lys Gly Arg Thr Leu Gin Arg Thr Ser Thr Ser 
840 845 850 

caa ctg aaa agg cct ttt gga gee ttt cga tct 
Gin Leu Lys Arg Pro Phe Gly Ala Phe Arg Ser 
860 . 865 

agt ggc aac caa aac aat aat aat aac tac cag 
Ser Gly Asn Gin Asn Asn Asn Asn Asn Tyr Gin 
875 880 

aaa ace tct aca ttg ccc ttg acc tea ctt ggg 
Lys Thr Ser Thr Leu Pro Leu Thr Ser Leu Gly 
890 895 

aag gga aac cct gee age tec get age aaa gga 
Lys Gly Asn Pro Ala Ser Ser Ala Ser Lys Gly 
905 910 

taatgaeatc aatagaaaaa tgaagaaatc ctacagcata 

gtcaaaagaa ctcttcttgt aaatcacttt ttaaattttc 

aattattgga aatttctgga ctatcctctt tggaaagaga 

ccagcagaag aacagaatat caggatgect taaatttata 

cattttgggg tgatatctgt atatataact tgttttttta 

tgattgggaa aatgtacgtt ttttaagagt agattgattc 



aac gaa teg ggg gat 
Asn Glu Ser Gly Asp 
855 

ate atg gag aca eta 
He Met Glu Thr Leu 
870 

gca gec aac cag ctg 
Ala Ala Asn Gin Leu 
885 

agg aag aca gat gee 
Arg Lys Thr Asp Ala 
900 

aag aat aag gca gca 
Lys Asn Lys Ala Ala 
915 

aagcacattg ctgagecaga 
tctcactgat gccctttgga 
accatgaaaa caatgcctca 
gtagtagact gtaaaagatt 
aaagatgccg tttaaaagca 
accctaccca caggacattc 



2226 

2274 

2322 

2370 

2418 

2466 

2514 

2562 

2610 

2658 

2706 

2754 

2802 

2850 

2910 
2970 
3030 
3090 
3150 
3210 
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accaagccac tgataccatt ttatatttca tcaattgcat gagtatttgc taatgttgat 3270 

tgaacctccc tttccccata atgtgggcag atttggctca gctccttcat gagatcaggt 3330 

cagtggtatt gtttctgtca agagtgtttt ttctgtcatt tctacttttt gtataaagga 3390 

aataaaacaa tgttaacagc caaaaaaaaa aaaaaaaaaa aa 3432 

<210> 2 

<2U> 919 

<212> PRT 

<213> NMJ)05460 SNCAIP, synphilin 

<400> 2 

Met Glu Ala Pro Glu Tyr Leu Asp Leu Asp Glu He Asp Phe Ser Asp 
1 5 10 15 

Asp He Ser Tyr Ser Val Thr Ser Leu Lys Thr He Pro Glu Leu Cys 
20 25 30 

Arg Arg Cys Asp Thr Gin Asn Glu Asp Arg Ser Ala Ser Ser Ser Ser 
35 40 45 

Trp Asn Cys Gly He Ser Thr Leu He Thr Asn Thr Gin Lys Pro Thr 
50 55 60 

Gly He Ala Asp Val Tyr Ser Lys Phe Arg Pro Val Lys Arg Val Ser 
65 70 75 80 

Pro Leu Lys His Gin Pro Glu Thr Leu Glu Asn- Asn Glu Ser Asp Asp 
85 90 95 

Gin Lys Asn Gin Lys Val Val Glu Tyr Gin Lys Gly Gly Glu Ser Asp 
100 105 . no 

Leu Gly Pro Gin Pro Gin Glu Leu Gly Pro Gly Asp Gly Val Gly Gly 
115 120 125 

Pro Pro Gly Lys Ser Ser Glu Pro Ser Thr Ser Leu Gly Glu Leu Glu 
130 135 140 

His Tyr Asp Leu Asp Met Asp Glu He Leu Asp Val Pro Tyr He Lys 
145 150 155 . 160 

Ser Ser Gin Gin Leu Ala Ser Phe Thr Lys Val Thr Ser Glu Lys Arg 
165 170 175 



He Leu Gly Leu Cys Thr Thr He Asn Gly Leu Ser Gly Lys Ala Cys 
180 185 190 
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Ser Thr Gly Ser Ser Glu Ser Ser Ser Ser Asn Met Ala Pro Phe Cys 
195 200 205 



Val Leu Ser Pro Val Lys Ser Pro His Leu Arg Lys Ala Ser Ala Val 
210 215 220 



lie His Asp Gin His Lys Leu Ser Thr Glu Glu Thr Glu lie Ser Pro 
225 230 235 240 



Pro Leu Val Lys Cys Gly Ser Ala Tyr Glu Pro Glu Asn Gin Ser Lys 
245 250 255 



Asp Phe Leu Asn Lys Thr Phe Ser Asp Pro His Gly Arg Lys Val Glu 
260 265 270 



Lys Thr Thr Pro Asp Cys Gin Leu Arg Ala Phe His Leu Gin Ser Ser 
275 280 285 



Ala Ala Glu Ser Lys Pro Glu Glu Gin Val Ser Gly Leu Asn Arg Thr 
290 295 300 



Ser Ser Gin Gly Pro Glu Glu Arg Ser Glu Tyr Leu Lys Lys Val Lys 
305 310 .315 320 



Ser He Leu Asn He Val Lys Glu Gly Gin He Ser Leu Leu Pro His 
325 330 335 



Leu Ala Ala Asp Asn Leu Asp Lys He His Asp Glu Asn Gly Asn Asn 
340 345 350 



Leu Leu His He Ala Ala Ser Gin Gly His Ala Glu Cys Leu Gin His 
355 360 365 



Leu Thr Ser Leu Met Gly Glu Asp Cys Leu Asn Glu Arg Asn Thr Glu 
370 375 380 



Lys Leu Thr Pro Ala Gly Leu Ala He Lys Asn Gly Gin Leu Glu Cys 
385 390 395 400 



Val Arg Trp Met Val Ser Glu Thr Glu Ala He Ala Glu Leu Ser Cys 
405 410 ' 415 



Ser Lys Asp Phe Pro Ser Leu He His Tyr Ala Gly Cys Tyr Gly Gin 
420 425 430 



Glu Lys He Leu Leu Trp Leu Leu Gin Phe Met Gin Glu Gin Gly He 
435 440 445 



Ser Leu Asp Glu Val Asp Gin Asp Gly Asn Ser Ala Val His Val Ala 
450 455 460 
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Ser Gin His Gly Tyr Leu Gly Cys He : Gin Thr Leu Val Glu Tyr Gly 
465 475 . 480 

Ala Asn Val Thr Met Gin Asn His Ala Gly Glu Lys Pro Ser Gin Ser 
485 49Q 495 

Ala Glu Arg Gin Gly His Thr Leu Cys Ser Arg Tyr Leu Val Val Val 
500 505 sio 

Glu Thr Cys Met Ser Leu Ala Ser Gin Val Val Lys Leu Thr Lys Gin 
515 520 525 

.Leu Lys Glu Gin Thr Val Glu Arg Val Thr Leu Gin Asn Gin Leu Gin 
530 535 540 

Gin Phe Leu Glu Ala Gin Lys Ser Glu Gly Lys Ser Leu Pro Ser Ser 
545 550 555 560 

Pro Ser Ser Pro Ser Ser Pro Ala Ser Arg Lys Ser Gin Trp Lys Ser 
565 570 575 

Pro Asp Ala Asp Asp Asp Ser Val Ala Lys Ser Lys Pro Gly Val Gin 
580 585 590 

Glu Gly He Gin Val Leu Gly Ser Leu Ser Ala Ser Ser Arg Ala Arg 
595 600 6 05 

Pro Lys Ala Lys Asp Glu Asp Ser Asp Lys He Leu Arg Gin Leu Leu 
610 615 620 

Gly Lys Glu He Ser Glu Asn Val Cys Thr Gin Glu Lys Leu Ser Leu 
625 6 30 635 . 640 

Glu Phe Gin Asp Ala Gin Ala Ser Ser Arg Asn Ser Lys Lys He Pro 
645 650 655 

Leu Glu Lys Arg Glu Leu Lys Leu Ala Arg Leu Arg Gin Leu Met Gin 
660 665 670 

Arg Ser. Leu Ser Glu Ser Asp Thr Asp Ser Asn Asn Ser Glu Asp Pro 
675 . 680 685 

Lys Thr Thr- Pro Val Arg Lys Ala Asp Arg Pro Arg Pro Gin Pro He 
690 695 700 

Val Glu Ser Val Glu Ser Met Asp Ser Ala Glu Ser Leu His Leu Met 
705 7 10 715 720 



He Lys Lys His Thr Leu Ala Ser Gly Gly Arg Arg Phe Pro Phe Ser 
725 730 735 
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He Lys Ala Ser Lys Ser Leu Asp Gly His Ser Pro Ser Pro Thr Ser 
740 745 750 

Glu Ser Ser Glu Pro Asp Leu Glu Ser Gin Tyr Pro Gly Ser Gly Ser 
755 760 765 

He Pro Pro Asn Gin Pro Ser Gly Asp Pro Gin Gin Pro Ser Pro Asp 
770 775 780 " 

Ser Thr Ala Ala Gin Lys Val Ala Thr Ser Pro Lys Ser Ala Leu Lys 
785 790 795 " 800 

Ser Pro Ser Ser Lys Arg Arg Thr Ser Gin Asn Leu Lys Leu Arg Val 
805 810 " 815 

Thr Phe Glu Glu Pro Val Val Gin Met Glu Gin Pro Ser Leu Glu Leu 
820 825 830 

Asn Gly Glu Lys Asp Lys Asp Lys Gly Arg Thr Leu Gin Arg Thr Ser 
835 840 845 

Thr Ser Asn Glu Ser Gly Asp Gin Leu Lys Arg Pro Phe Gly Ala Phe 
850 855 860 

Arg Ser He Met Glu Thr Leu Ser Gly Asn Gin Asn Asn Asn Asn Asn 
865 870 875 880 

Tyr Gin Ala Ala Asn Gin Leu Lys Thr Ser Thr Leu Pro Leu Thr Ser 
885 890 895 

Leu Gly Arg Lys Thr Asp Ala Lys Gly Asn Pro Ala Ser Ser Ala Ser 
900 905 910 

Lys Gly Lys Asn Lys Ala Ala 
915 

<210> 3 
<211> 4181 
<212> DNA 

<213> NM_002387 MCC, mutated in colorectal cancers 

<220> 

<221> CDS 

<222> (221) . . (2707) 

<223> 
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<400> 3 

cctcctgcag caatggctcg tccgtgaaac gcgagccacg gctgctcttt ttaagagtgc 60 

ctgcatcctc cgtttgcgct tcgcaactgt cctgggtgaa aatggctgtc tagactaaaa 120 

tgtggcagaa gggaccaagc agtggatatt gagcctgtga agtccaacto ttaagctccg 18 0 

agacctgggg gactgagagc ccagctctga aaagtgcatc atg aat tec gga gtt 235 

Met Asn Ser Gly Val 



1 5 



lit Mo? f 34 ! at 9 ? a aa ° 93C tCC tcg gcc 9*9 ct 9 a gt Sag etc cat 
Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala. Glu leu Ser Glu Leu His 



10 15 



20 



til a? 3 g ?° f t9 9Ca tCa ° ta aag gga gat ata ? aa ctt aat aaa 
Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp He Val Glu Leu Asn Lyt 
« 30 



35 



100 



283 



331 



a™ T Ct ° n? g S? a aCa gag agg gaa cg * 9ac ctt ctg gaa aag aaa ttg 379 
Arg Leu Gin Gin Thr Glu Arg Glu Arg Asp Leu Leu Glu Lys Lys Leu 
^ u 45 50 

gee aag gca cag tgc gag cag tec cac etc atg aga gag cat aaa oat &->n 
Ala Lys Ala Gin Cys Glu Gin Ser His Leu Mer Arg 111 His Glu Isp 27 



£l £X r 3 ,! a 93 ^ 9 ^ 9 Ctt C9C t3t gag gaa cgc atc a <= a sag etc 475 
Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg Il e Thr 111 leu 

75 8P 85 

His Ser ?? 9 2? 9 ° tC 330 339 aag ata gac c gt c tg caa gge 523 

H 1S Ser Val lie Ala Glu Leu Asn Lys Lys He Asp Arg Leu Gin Gly 
90 95 



^° ^ ° a ^° 399 gag gaa gat gag tac tc a gaa ctg cga tea aaa etc in 
Thr Thr He Arg Glu Glu Asp Glu Tyr Ser Glu Leu Arg Ser Glu Leu 
105 HO i 15 

s a e s si s s e s 5 e is s a ^ s; « 

125 130 

nf 9 e Ct 9tC tCt atc CCC gaa aac ^ tct acc atg gtt act 
Asp Gin Thr Ser Val Ser He Pre Glu Asn Gin Ser Thr Me? Val S: 
iJ3 140 145 

??! f aC S*? 930 aa ° tgC agt gao ctg aao tca gaa ctg cag agg gtg 
Ala Asp Met Asp Asn Cys Ser Asp Leu Asn Ser Gin Leu Gin Arg Val 
155 160 165 

ctg aca ggg ctg gag aat gtt gtc tgc ggc agg aaq aaa aac aor rm- 
Leu Thr Gly Leu Glu Asn Val Val Cys lly £ J* ^ g 

170 175 lao 

age etc tec gtg gec gag gtg gac agg cac att gag cag etc acc aca fii i 

Ser Leu Ser Val Ala Glu Val Asp Arg His lie Glu Gin 2u S£ Thr 8U 
185 190 195 

gcc age gag cac tgt gac ctg get att aag aca gtc gag gag att qaa 859 
Ala Ser Glu His Cys Asp Leu Ala lie Lys Thr Val Glu Glu lie 111 
zo ° 205 210 

ggg gtg ctt ggc egg gac ctg tat ccc aac ctg get gaa gag aaa tct 907 
Gly val Leu Gly Arg Asp Leu Tyr Pre Asn Leu La Glu Glu *Jg s2 
215 220 225 



667 



715 



763 
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egg tgg gag aag gag ctg get ggg ctg agg gaa gag aat gag age ctg 955 
Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu Asn Glu Ser Leu 
230 235 240 245 



act gee atg ctg tgc age aaa gag gaa gaa ctg aac egg act aag gec 
Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn Arg Thr Lys Ala 
250 -255 260 



1003 



acc atg aat gec ate egg gaa gag egg gac egg etc egg agg egg gtc 1051 
Thr Met Asn Ala He Arg Glu Glu Arg Asp Arg Leu Arg Arg Arg Val 
265 270 275 

aga gag ctt caa act cga eta cag age gtg cag gee aca ggt ccc tec 1099 
Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala Thr Gly Pro Ser 
280 285 290 

age cct .ggc cgc etc act tec acc aac cgc ccg att aac ccc age act 1147 
Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro He Asn Pro Ser Thr 
295 300 305 

ggg gag ctg age aca age age age age aat gac att ccc ate gec aag 1195 
Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie Pro He Ala Lys 
310 315 320 325 

att get gag agg gtg aag eta tea aag aca agg tec gaa teg tea tea 1243 
He Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser Glu Ser Ser Ser 
330 335 340 

tct gat egg cca gtc ctg ggc tea gaa ate agt age ata ggg gta tec 1291 
Ser Asp Arg Pro Val Leu Gly Ser Glu He Ser Ser He Gly Val Ser 
345 m 350 355 

age agt gtg get gaa cac ctg gee cac tea ctt cag gac tgc tec aat 1339 
Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin Asp Cys Ser Asn 
360 365 370 

ate caa gag att ttc caa aca etc tac tea cac gga tct gee ate tea 1387 
He Gin Glu He Phe Gin Thr Leu Tyr Ser His Gly Ser Ala He Ser 
375 380 385 

gaa age aag att aga gag ttt gag gtg gaa aca gaa egg ctg aat age 1435 
Glu Ser Lys He Arg Glu Phe Glu Val Glu Thr Glu Arg Leu Asn Ser 
390 395 400 405 

egg att gag cac etc aaa tec caa aat gac etc ctg acc ata acc ttg 1483 
Arg He Glu His Leu Lys Ser Gin Asn Asp Leu Leu Thr He Thr Leu 
410 415 420 

gag gaa tgt aaa age aat get gag agg atg age atg ctg gtg gga aaa 1531 
Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met Leu Val Gly Lys 
425 430 435 

tac gaa tec aat gec aca gcg ctg agg ctg gee ttg cag tac age gag 1579 
Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu Gin Tyr Ser Glu 
440 445 450 

cag tgc ate gaa gec tac gaa etc etc ctg gcg ctg gca gag agt gag 1627 
Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala Leu Ala Glu Ser Glu 
455 460 465 

cag age etc ate ctg ggg cag ttc cga gcg gcg ggc gtg ggg tec tec 1675 
Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala Gly Val Gly Ser Ser 
470 475 480 485 

cct gga gac cag teg ggg gat gaa aac ate act cag atg etc aag cga 1723 
Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr Gin Met Leu Lys Arg 
490 495 500 
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get cat gac tgc egg aag aca' get gag aac get gee aag gee ctg etc 1771 
Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala Ala Lys Ala Leu Leu 
505 510 515 

atg aag ctg gac ggc age tgt ggg gga gee ttt gee gtg gee ggc tgc 1819 
Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala Val Ala Gly Cys 
520 525 530 

age gtg cag ccc tgg gag age ctt tec. tec aac age cac acc age aca .1867 
Ser Val Gin Pro Trp Glu Ser Leu Ser .Ser Asn Ser His Thr Ser Thr 
535 540 545 

acc age tec aca gee agt agt tgc gac acc gag ttc act aaa gaa gac 1915 
Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe Thr Lys Glu Asp 
550 555 560 565 

gag cag agg ctg aag gat tat ate cag cag etc aag aat gac agg get 1963 
Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu Lys Asn Asp Arg Ala 
570 575 580 

gcg gtc aag ctg acc atg ctg gag ctg gaa age ate cac ate gat cct 2011 
Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser He His He Asp Pro 
585 590 595 

etc age tat gac gtc aag cct egg gga gac age cag agg ctg gat ctg 2059 
Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser Gin Arg Leu Asp Leu 
600 605 610 

gaa aac gca gtg ctt atg cag gag etc atg gec atg aag gag gag atg 2107 
Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met Lys Glu Glu Met 
615 620 • 625 

gee gag ttg aag gec cag etc tac eta ctg gag aaa gag aag aag gec 2155 
Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys Glu Lys Lys Ala 
«0 635 640 645 

ctg gag ctg aag ctg age acg egg gag gec cag gag cag gee tac ctg 2203 
Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu Gin Ala Tyr Leu 
650 655 660 

gtg cac att gag cac ctg aag tec gag gtg gag gag cag aag gag cag 2251 
Val His He Glu His Leu Lys Ser Glu Val Glu Glu Gin Lys Glu Gin 
665 670 675 

egg atg cga tec etc age tec acc age age ggc age aaa gat aaa cct 2299 
Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser Lys Asp Lys Pro 
680 685 . 690 

ggc aag gag tgt get gat get gec tec cca get ctg tec eta get gaa 2347 
Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu Ser Leu Ala Glu 
695 700 705 

etc agg aca acg tgc age gag aat gag ctg get gcg gag ttc acc aac 2395 
Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala Glu Phe Thr Asn 
7 *0 715 720 725 

gee att cgt cga gaa aag aag ttg aag gee aga gtt caa gag ctg gtg 2443 
Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg Val Gin Glu Leu Val 
730 735 740 

agt gec ttg gag aga etc acc aag age agt gaa ate cga cat cag caa 2491 
Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu He Arg His Gin Gin 
745 750 jS5 

tct gca gag ttc gtg aat gat eta aag egg gec aac age aac ctg gtg 2539 
Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn Ser Asn Leu Val 
760 765 770 
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get gec tat gag aaa gca aag aaa aag cat caa aac aaa ctg aag aag 2587 
Ala Ala Tyr Glu Lys Ala Lys Lys Lys .His Gin Asn Lys Leu Lys Lys 
775 780 785 

tta gag teg cag atg atg gee atg gtg gag aga cat gag acc caa gtg 2635 
Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His Glu Thr Gin Val 
790 795 800 805 

agg atg etc aag caa aga ata get ctg eta gag gag gag aac tec agg 2683 
Arg Met Leu Lys Gin Arg lie Ala Leu Leu Glu Glu Glu Asn Ser Arg 
810 815 . 820 

cca cac acc aat gaa act teg ctt taatcagcac tcacgcaccg gagttctgee 2737 
Pro His Thr Asn Glu Thr Ser Leu 
825 

catgggaagt aaactgeage aggecactgg ggacagaagg gcccatgtac ttgttgggag 2797 

gaggaggaaa gggaaggctg gcaggtaggt eggcacttgg acaatggagt gccccaactc 2857 

aacccttggg gtgactggcc atggtgacat tgtggactgt atccagaggt gcccgctctt 2917 

ccctcctggg cccacaacag cgtgtaaaca catgttctgt gcctgctcag cagagcctcg 2977 

tttctgcttt cagcactcac tctccccctc ctcttctggt ctggcggctg tgcatcagtg 3037 

ggatcccaga catttgtttc tgtaagattt tccattgtat cctctttttg gtagatgctg 3097 

ggctcatctt ctagaatctc gtttctcctc tttcctcctg cttcatggga aaacagacct 3157 

gtgtgtgcct ccagcattta aaaggactgc tgatttgttt actacagcaa ggctttggtt 3217 

tccaagtccc gggtctcaac tttaagatag aggeggecat aagaggtgat ctctgggagt 3277 

tataggtcat gggaagagcg tagacaggtg ttacttacag tcccagatac actaaagtta 3337 

caaacagacc accaccagga ctgtgcctga acaattttgt attgagagaa taaaaacttc 3397 

cttcaatctt cattttggag gcagggctgg gaagggagcg ctctcttgat tctgggattt 3457 

ctccctctca gtggagcctt attaatatcc aagacttaga gctgggaatc tttttgatac 3517 

ctgtagtgga actaaaattc tgtcaggggt ttcttcaaga gctgagaaac attattagca 3577 

cttcccgccc cagggcacta cataattget gttctgctga atcaaatctc ttccacatgg 3637 

gtgcatttgt agctctggac ctgtctctac ctaaggacaa gacactgagg agatactgaa 3697 

cattttgeaa aacttatcac gectacttaa gagtgctgtg taacccccag ttcaagactt 3757 

agctcctgtt gtcatgaegg ggacagagtg agggaatggt agttaaggct tettttttge 3817 

ccccagatac atggtgatgg ttagcatatg gtgcttaaaa ggttaaattt caagcaaaat 3877 

gcttacaggg ctaggcagta ccaaagtaac tgaattattt caggaaggtc ttcaatctta 3937 

aaacaaattc attattcttt ttcagtttta. cctcttctct ctcagttcta cactgataca 3997 

cttgaaggac catttactgt ttttttctgt agcaccagag aatccatcca aagttcccta 4057 

tgaaaaatgt gttccattgc catagctgac tacaaattaa agttgaggag gtttctgeat 4117 

agagtcttta tgtccataag etaegggtag gtctattttc agagcatgat acaaattcca 4177 

cagg 418 i 
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<210> 4 
;<211> 829 

<212> PRT ' 

<213> NM_002387 MCC, mutated in colorectal cancers 
<400> 4 

Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 
1 5 .10 15 

Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp He 
20 25 30 

Val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Arg Asp Leu 
35 40 45 

Leu Glu Lys Lys Leu Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met 
50 55 60 

Arg Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu 
65 70 75 80 

Arg He Thr Glu Leu His Ser Val He Ala Glu Leu Asn Lys Lys He 
85 90 95 

Asp Arg Leu Gin Gly Thr Thr lie Arg Glu Glu Asp Glu Tyr Ser Glu 
100 105 no 

Leu Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser 
115 120 125 

Arg Ser Met Asp Gin Asp Gin Thr Ser Val Ser He Pro Glu Asn Gin 
130 135 140 

Ser Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp Leu Asn Ser 
145 150 155 160 

Glu Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg 
165 170 175 

Lys Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His He 
180 185 190 

Glu Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala He Lys Thr 
195 200 205 



Val Glu Glu lie Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu 
210 215 220 
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Ala Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu 
225 230 235 " "* 240 



Glu Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu 
245 250 255 



Asn Arg Thr Lys Ala Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg 
260 265 270 



Leu Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin 
275 280 285 



Ala Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro 
290 295 300 



He Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp 
305 310 315 320 



He Pro He Ala Lys He Ala Glu Arg Val Lys Leu Ser Lys Thr Arg 
325 330 335 



Ser Gin Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu He Ser 
340 345 350 



Ser He Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu 
355 360 365 



Gin Asp Cys Ser Asn He Gin Glu He Phe Gin Thr Leu Tyr Ser His 
370 375 380 



Gly Ser Ala He Ser Glu Ser Lys He Arg Glu. Phe Glu Val Glu Thr 
385 390 395 400 



Glu Arg Leu Asn Ser Arg He Glu His Leu Lys Ser Gin Asn Asp Leu 
405 410 415 



Leu Thr He Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser 
420 " 425 430 



Met Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala 
435 440 445 



Leu Gin Tyr Ser Glu Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala 
450 455 460 



Leu Ala Glu Ser Glu Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala 
465 470 475 480 



Gly Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr 
485 490 • 495 



WO 2004/022778 



15 



PCT/AU2003/001166 



Gin Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala 
500 505 510 

Ala Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe 
515 520 525 

Ala Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn 
530 535 -540 

Ser His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu 
545 550 555 560 

Phe Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr lie Gin Gin Leu 
565 570 575 

Lys Asn Asp Arg Ala Al'a Val Lys Leu Thr Met Leu Glu Leu Glu Ser 
580 585 590 

He His lie. Asp Pro Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser 
595 600 605 

Gin Arg Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala 
€10 615 620 



Met Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu 
625 630 635 640 

Lys Glu Lys Lys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin 
645 650 ~ 655 

Glu Gin Ala Tyr Leu Val His He Glu His Leu Lys Ser Glu Val Glu 
660 665 670 

Glu Gin Lys Glu Gin Arg Met Arg Ser Leu Ser . Ser, Thr Ser Ser Gly 
675 680 685 

Ser Lys Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala 
690 695 700 

Leu Ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala 
7 °5 710 715 720 

Ala Glu Phe Thr Asn Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg 
725 730 735 

Val Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu 
740 745 750 



He Arg His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala 
755 760 765 
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Asn Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin " 
770 775 780 

Asn Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg 
785 790 795 800 

His Glu Thr Gin Val Arg Met Leu Lys Gin Arg He Ala Leu Leu Glu 
805 810 815 

Glu Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 

<210> 5 
<211> 2830 
<212> DNA 

<213> NMJJ22117 SE20-4 
<220> 

<221> CDS 

<222> (130) . . (2208) 

<223> 
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<400> 5 

aattcggcac gaggagagct ggttgcgtga gtctcctcag ctctgcttac cggtgcgact 60 

agcggcagcg acgcggctaa aagcgaaggg gcgagtgcga gtcccctgag ctgtacgaac 120 

gcggtcgcc atg gac cgc cca gat gag ggg cct ccg gcc aag acc cgc cgc 171 
Met Asp Arg Pro Asp Glu Gly Pro Pro Ala Lys Thr Arg Arg 
1 5 10 

ctg age age tec gag tct cca cag cgc gac ccg ccc ccg ccg ccg ccg 219 
Leu Ser Ser Ser Glu Ser Pro Gin Arg Asp Pro Pro Pro Pro Pro Pro 
15 20 25 30 

ccg ccg ccg etc etc cga ctg ccg ctg cct cca ccc cag cag cgc ccg 267 
Pro Pro Pro Leu Leu Arg Leu Pro Leu Pro Pro Pro Gin Gin Arg Pro 
35 40 45 

agg etc cag gag gaa acg gag gcg gca cag gtg ctg gcc gat atg agg 315 
Arg Leu Gin Glu Glu Thr Glu Ala Ala Gin Val Leu Ala Asp Met Arg 
50 55 60 

ggg gtg gga ctg ggc ccc gcg ctg ccc ccg ccg cct ccc tat gtc att 363 
Gly Val Gly Leu Gly Pro Ala Leu Pro- Pro Pro Pro Pro Tyr Val He 
65 70 75 

etc gag gag ggg ggg ate cgc gca tac ttc acg etc ggt get gag tgt 411 
Leu Glu Glu Gly Gly He Arg Ala Tyr Phe Thr Leu Gly Ala Glu Cys 
80 85 90 

ccc ggc tgg gat tct acc ate gag teg ggg tat ggg gag gcg ccc ccg 459 
Pro Gly Trp Asp Ser Thr He Glu Ser Gly Tyr Gly Glu Ala Pro Pro 
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95 100 105 110 

ccc acg gag age ctg gaa gca etc ccc act cct gag gec teg ggg ggg 507 
Pro Thr Glu Ser Leu Glu Ala Leu Pro.Thr Pro Glu Ala Ser Gly Gly 
115 120 125 

age ctg gaa ate gat ttt cag gtt gta cag teg age agt ttt ggt gga 555 
Ser Leu Glu lie Asp Phe Gin Val Val Gin Ser Ser Ser Phe Gly Gly 
130 135 140 

gag ggg gee eta gaa ace tgt age gca gtg ggg tgg gcg ccc cag agg " 603 
Glu Gly Ala Leu Glu Thr Cys Ser Ala Val Gly Trp Ala Pro Gin Arg 
145 150 155 

tta gtt gac ccg aag age aag gaa gag gcg ate ate ata gtg gag gat 651 
Leu Val Asp Pro Lys Ser Lys Glu Glu Ala lie lie lie Val Glu Asp 
160 165 170 

gag gat gag gat gag egg gag agt atg agg age age agg agg egg egg 699 
Glu Asp Glu Asp Glu Arg Glu Ser Met Arg Ser Ser Arg Arg Arg Arg 
175 180 185 190 

egg egg egg agg agg aag cag agg aag gtg aag agg gaa age aga gag 747 
Arg Arg Arg Arg Arg Lys Gin Arg Lys Val Lys Arg Glu Ser Arg Glu 
195 200 205 

aga aat gee gag agg atg gag age ate ctg cag gca ctg gag gat att 795 
Arg Asn Ala Glu Arg Met Glu Ser lie Leu Gin Ala Leu Glu Asp lie 
210 215 220 

cag ctg gat ctg gag gca gtg aac ate aag gca ggc aaa gee ttc ctg 843 
Gin Leu Asp Leu Glu Ala Val Asn He .Lys Ala Gly -Lys Ala Phe Leu 
225 230 235 

cgt etc aag cgc aag ttc ate cag atg cga aga ccc ttc ctg gag cgc 891 
Arg Leu Lys Arg Lys Phe He Gin Met Arg. Arg Pro Phe Leu Glu Arg 
240 245 250 

aga gac etc ate ate cag cat ate cca ggc ttc tgg gtc aaa gca ttc 939 
Arg Asp Leu He He Gin His He Pro Gly Phe Trp Val Lys Ala Phe 
255 260 265 270 

etc aac cac ccc aga att tea att ttg ate aac cga cgt gat gaa gac 987 
Leu Asn His Pro Arg He Ser He Leu He Asn Arg Arg Asp Glu Asp 
275 280 285 

att ttc cgc tac ttg ace aat ctg cag gta cag gat etc aga cat ate 1035 
He Phe Arg Tyr Leu Thr Asn Leu Gin Val Gin Asp Leu Arg His He 
290 295 300 

tec atg ggc tac aaa atg aag ctg tac ttc cag act aac ccc tac ttc 1083 
Ser- Met Gly Tyr Lys Met Lys Leu Tyr Phe Gin Thr Asn Pro Tyr Phe 
305 310 315 

aca .aac atg gtg att gtc aag gag ttc cag cgc aac cgc tea ggc egg 1131 
Thr Asn Met Val He Val Lys Glu Phe Gin Arg Asn Arg Ser Gly Arg 
320 325 330 

ctg gtg tct cac tea acc cca ate cgc tgg cac egg ggc cag gaa ccc 1179 
Leu Val Ser His Ser Thr Pro He Arg Trp His Arg Gly Gin Glu Pro 
335 340 345 350 

cag gec cgt cgt cac ggg aac cag gat gcg age cac age ttt ttc age 1227 
Gin Ala Arg Arg His Gly Asn Gin Asp. Ala.. Ser His Ser Phe Phe Ser 
355 360 365 

tgg ttc tea aac cat age etc cca gag get gac agg att get gag att 1275 
Trp Phe Ser Asn His Ser Leu Pro Glu Ala Asp Arg He Ala Glu He 



WO 2004/022778 



18 



PCT/AU2003/001166 



370 375 . 380 

ate aag aat gat ctg tgg gtt aac cct eta cgc tac tac ctg aga gaa 1323 
lie Lys Asn Asp Leu Trp Val Asn Pro Leu Arg Tyr Tyr Leu Arg Glu 
385 390- 395 

agg ggc tec agg ata aag aga aag aag caa gaa atg aag aaa cgt aaa 1371 
Arg Gly Ser Arg lie Lys Arg Lys Lys Gin Glu Met Lys Lys Arg Lys 
400 405 410 

ace agg ggc aga tgt gag gtg gtg ate atg gaa gac gee cct gac tat 1419 
Thr Arg Gly Arg Cys Glu Val Val He Met Glu Asp Ala Pro Asp Tyr 
415 420 425 430 

tat gca gtg gaa gac att ttc age gag ate tea gac att gat gag aca 1467 
Tyr Ala Val Glu Asp He Phe Ser Glu He Ser Asp He Asp Glu Thr 
435 440 445 

att cat gac ate aag ate tct gac ttc atg gag ace ace gac tac ttc 1515 
He His Asp He Lys He Ser Asp Phe Met Glu Thr Thr Asp Tyr Phe 
450 455 460 

gag acc act gac aat gag ata act gac ate aat gag aac ate tgc gac 1563 
Glu Thr Thr Asp Asn Glu He Thr Asp He Asn Glu Asn He Cys Asp 
465 470 475 

age gag aat cct gac cac aat gag gtc ccc aac aac gag acc act gat 1611 
Ser Glu Asn Pro Asp His Asn Glu Val Pro Asn Asn Glu Thr Thr Asp 
480 485 490 

aac aac gag agt get gat gac cac gaa acc act gac aac aat gag agt 1659 
Asn Asn Glu Ser Ala Asp Asp His Glu Thr Thr Asp Asn Asn Glu Ser 
495 500 505 510 

gca gat gac aac aac gag aat cct gaa gac aat aac aag aac act gat 1707 
Ala Asp Asp Asn Asn Glu Asn Pro Glu Asp Asn Asn Lys Asn Thr Asp 
515 520 525 

gac aac gaa gag aac cct aac aac aac gag aac act tac ggc aac aac 1755 
Asp Asn Glu Glu Asn Pro Asn Asn Asn Glu Asn Thr Tyr Gly Asn Asn 
530 535 "* 540 

ttc ttc aaa ggt ggc ttc tgg ggc age cat ggc aac aac cag gac age 1803 
Phe Phe Lys Gly Gly Phe Trp Gly Ser His Gly Asn Asn Gin Asp Ser 
545 550 555 

age gac agt gac aat gaa gca gat gag gee agt gat gat gaa gat aat 1851 
Ser Asp Ser Asp Asn Glu Ala Asp Glu Ala Ser Asp Asp Glu Asp Asn 
560 565 570 

gat ggc aac gaa ggt gac aat gag ggc agt gat gat gat ggc aat gaa 1899 
Asp Gly Asn Glu Gly Asp Asn Glu Gly Ser Asp Asp Asp Gly Asn Glu 
575 580 585 * 590 

ggt gac aat gaa ggc age gat gat gac gac aga gac att gag tac tat 1947 
Gly Asp Asn Glu Gly Ser Asp Asp Asp Asp Arg Asp He Glu Tyr Tyr 
595 600 605 

gag aaa gtt att gaa gac ttt gac aag gat cag get gac tac gag gac 1995 
Glu Lys Val He Glu Asp Phe Asp Lys Asp Gin Ala Asp Tyr Glu Asp 
610 615 620 

gtg ata gag ate ate tea gac gaa tea gtg gaa gaa gag ggc att gag 2043 
Val He Glu He He Ser Asp Glu Ser Val Glu Glu Glu Gly He Glu 
625 630 635 

gaa ggc ate cag caa gat gag gac ate tat gag gaa gga aac tat gag 2091 
Glu Gly He Gin Gin Asp Glu Asp He Tyr Glu Glu Gly Asn Tyr Glu 
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645 



650 



gag gaa gga agt gaa gat gtc tgg gaa gaa ggg gaa gat teg gac gac 2139 
Glu Glu Gly Ser Glu Asp Val Trp Glu Glu Gly Glu Asp Ser Asp Asp 
655 660 • 665 670 

tct gac eta gag gat gtg ctt cag gtc cca aac ggt tgg gee aat ccg 2187 
Ser Asp Leu Glu Asp Val Leu Gin Val Pro Asn Gly Trp Ala Asn Pro 
675 680 685 

ggg aag agg ggg aaa acc gga taagggtttt ccccttttgg ggatcacctc 2238 
Gly Lys Arg Gly Lys Thr Gly 
690 

tctgtatccc ccacccacta tcccatttgc cctcctcctc agetagggee acgcggcccc 2298 

acattgeact tctggggggt gaccgacttc gtacacgggt ttaaagttta tttttatggt 2358 

ttagtcattg cagagttctt attttggggg gagggaaagg gggctagtcc ccttcttttg 2418 

gccctccgcc cccgcaggct tctgtgtgct gctaactgta tttattgtga tgccttggtc 2478 

agggcccctc tacccacttc tcccagtcag ttgtggcccc agcccctctc cctgtgctgt 2538 

gtggagtgga caccctgacc cccgaagcgg ggagggcege tgtggccttc gtcacagccg 2598 

cgcagtgccc atggaggcgc tgctgccacc ttcctctccc. aagttctttc tccatccctc 2658 

tcctcttccc gccgcgccgc tagcccgcct cggtgtctat gcaaggccgc ttcgccattg 2718 

eggtattett tgeggtatte ttgtccccgt cccccagaag gctcgcctct ccccgtggac 2778 

cctgttaatc ccaataaaat tctgagcaag ttcaaaaaaa aaaaaaaaaa aa 2830 

<210> 6 

<211> 693 

<212> PRT 

<213> NM__022117 SE20-4 

<400> 6 

Met Asp Arg Pro Asp Glu Gly Pro Pro Ala Lys Thr Arg Arg Leu Ser 
1 5 io 15 

Ser Ser Glu Ser Pro Gin Arg Asp Pro Pro Pro Pro Pro Pro Pro Pro 
20 25 30 

Pro Leu Leu Arg Leu Pro Leu Pro Pro Pro Gin Gin Arg Pro Arg Leu 
35 40 45 

Gin Glu Glu Thr Glu Ala Ala Gin Val Leu Ala Asp Met Arg Gly Val 
50 55 60 



Gly Leu Gly Pro Ala Leu Pro Pro Pro Pro Pro Tyr. Val lie Leu Glu 
65 70 75 80 



Glu Gly Gly lie Arg Ala Tyr Phe Thr Leu Gly Ala Glu Cys Pro Gly 
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85 90 95 

Trp Asp Ser Thr lie Glu Ser Gly Tyr Gly Glu Ala Pro Pro Pro Thr 
100 105 J 110 

Glu Ser Leu Glu Ala Leu Pro Thr Pro Glu Ala Ser Gly Gly Ser Leu 
115 120 125 

Glu lie Asp Phe Gin Val Val Gin Ser Ser Ser Phe Gly Gly Glu Gly 
130' 135 140 

Ala Leu Glu Thr Cys Ser Ala Val Gly Trp Ala Pro Gin Arg Leu Val 
. 145 150 155 160 

Asp Pro Lys Ser Lys Glu Glu Ala lie He He Val Glu Asp Glu Asp 
165 170 175 

Glu Asp Glu Arg Glu Ser Met Arg Ser Ser Arg Arg Arg Arg Arg Arg 
180 185 190 

Arg Arg Arg Lys Gin Arg Lys Val Lys Arg Glu Ser Arg Glu Arg Asn 
195 200 205 

Ala Glu Arg Met Glu Ser He Leu Gin Ala Leu Glu Asp He Gin Leu 
210 215 220 

Asp Leu Glu Ala Val Asn He Lys Ala Gly Lys Ala Phe Leu Arg Leu 
225' 230 235 240 

Lys Arg Lys Phe He Gin Met Arg Arg Pro Phe Leii Glu Arg Arg Asp 
245 250 255 

Leu He He Gin His lie Pro Gly Phe Trp Val Lys Ala Phe Leu Asn 
260 265 270 

His Pro Arg He Ser He Leu He Asn Arg Arg Asp Glu Asp He Phe 
275 280 285 

Arg Tyr Leu Thr Asn Leu Gin Val Gin Asp Leu Arg His He Ser Met 
290 295 300 

Gly Tyr Lys Met Lys Leu Tyr Phe Gin Thr Asn Pro Tyr Phe Thr Asn 
305 310 315 320 

Met Val He Val Lys Glu Phe Gin Arg Asn Arg Ser Gly Arg Leu Val 
325 330 335 

Ser His Ser Thr Pro He Arg Trp His Arg Gly Gin Glu Pro Gin Ala 
340 345 350 



Arg Arg His Gly Asn Gin Asp Ala Ser His Ser Phe Phe Ser Trp Phe 



,1 

WO 2004/022778 PCT/AU2003/001166 

21 

355 360 365 

Ser Asn His Ser Leu Pro Glu Ala Asp Arg He Ala Glu He He Lys 
370 375 3 8 o 

Asn Asp Leu Trp Val Asn Pro Leu Arg "Tyr Tyr Leu Arg Glu Arg Gly 
385 390 395 400 

Ser Arg He Lys Arg Lys Lys Gin Glu Met Lys Lys Arg Lys Thr Arg 
405 410 " J 415 

Gly Arg Cys Glu Val Val He Met Glu Asp Ala Pro Asp Tyr Tyr Ala 
•420 425 * 430 

Val Glu Asp He Phe Ser Glu lie Ser Asp He Asp Glu Thr He His 
435 440 445 

Asp He Lys He Ser Asp Phe Met Glu Thr Thr Asp Tyr Phe Glu Thr 
450 455 460 

Thr Asp Asn Glu He Thr Asp He Asn Glu Asn He Cys Asp Ser Glu 
465 470 475 480 

Asn Pro Asp His Asn Glu Val Pro Asn Asn Glu Thr Thr Asp Asn Asn 
485 490 495 

Glu Ser Ala Asp Asp His Glu Thr Thr Asp Asn Asn Glu Ser Ala Asp 
500 505 510 

Asp Asn Asn Glu Asn Pro Glu Asp Asn Asn Lys" Asn Thr Asp Asp Asn 
515 520 525 

Glu Glu Asn Pro Asn Asn Asn Glu Asn Thr Tyr Gly Asn Asn Phe Phe 
530 535 540 

Lys Gly Gly Phe Trp Gly Ser His Gly Asn Asn Gin Asp Ser Ser Asp 
545 550 555 560 

Ser Asp Asn Glu Ala Asp Glu Ala Ser Asp Asp Glu Asp Asn Asp Gly 
565 570 575 

Asn Glu Gly Asp Asn Glu Gly Ser Asp Asp Asp Gly Asn Glu Gly Asp 
580 585 590 

Asn Glu Gly Ser Asp Asp Asp Asp Arg Asp He Glu Tyr Tyr Glu Lys 
595 600 605 

Val He Glu Asp Phe Asp Lys Asp Gin Ala Asp Tyr Glu Asp Val He 
610 615 620 



Glu He He Ser Asp Glu Ser Val Glu Glu Glu Gly He Glu Glu Gly 
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625 ' 630 635 640 

lie Gin Gin Asp Glu Asp lie Tyr Glu Glu Gly Asn Tyr Glu Glu Glu 
645 650 ^ 655 

Gly Ser Glu Asp Val Trp Glu Glu Gly Glu Asp Ser Asp Asp Ser Asp 
660 665 670 

Leu Glu Asp Val Leu Gin Val Pro Asn Gly Trp Ala Asn Pro Gly Lys 
675 680 * 685 

Arg Gly Lys Thr Gly 
690 

<210> 7 

<211> 2632 

<212> DNA 

<213> NM 016463 HSPC195 



i ) 

WO 2004/022778 



<220> 

<221> CDS 

<222> (931) . . (1611) 

<223> 



<400> 7 
ggcggctgag 


cctgagcggg gatgtagagg cggcggcagc agaggcggca ctggcggcaa 


60 


gagcagacgc 


ccgagccgag cgagaagagc ggcagagcct tatcccctga agccgggccc 


120 


cgcgtcccag 


ccctgcccag cccgcgccca gccatgcgcg ccgcctgctg agtccgggcg 


180 


ccgcacgctg 


agccctccgc- ccgcgagccg cgctcagctc gggggtgatt agttgctttt 


240 


tgttgttttt 


taatttgggc cgcggggagg gggaggaggg gcaggtgctg caggctcccc 


300 


cccctccccg 


cctcgggcca gccgcggcgg cgcgactcgg gctccggacc cgggcactgc 


360 


tggcggctgg 


agcggagcgc accgcggcgg tggtgcccag agcggagcgc agctccctgc 


420 


cccgcccctc 


cccctcggcc tcgcggcgac ggcggcggtg gcggcttgga cgactcggag 


480 


agccgagtga 


agacatttcc acctggacac ctgaccatgt gcctgccctg agcagcgagg 


540 


cccaccaggc 


atctctgttg tgggcagcag ggccaggtcc tggtctgtgg accctcggca 


600 


gttggcaggc 


tccctctgca gtggggtctg ggcctcggcc ccaccatgtc gagcctcggc 


660 


ggtggctccc 


aggatgccgg cggcagtagc agcagcagca ccaatggcag cggtggcagt 


720 


ggcagcagtg 


gcccaaaggc aggagcagca gacaagagtg cagtggtggc tgccgccgca 


780 


ccagcctcag 


tggcagatga cacaccaccc cccgagcgtc ggaacaagag cggtatcatc 


840 
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agtgagcccc tcaacaagag cctgcgccgc tcccgcccgc tctcccacta ctcttctttt 900 

ggcagcagtg gtggtagtgg cggtggcagc atg atg ggc gga gag tct get gac 954 

Met Met 'Gly Gly Glu Ser Ala Asp 

1 '5 



aag gec act gcg get gca gec get. gec tec ctg ttg gec aat ggg cat 
Lys Ala Thr Ala Ala Ala Ala Ala Ala Ser Leu Leu Ala Asn Gly His 
10 15 20 



1002 



gac ctg gcg gcg gec atg gcg gtg gac aaa age aac cct acc tea aag 1050 
Asp Leu Ala Ala Ala Met Ala Val Asp Lys Ser Asn Pro Thr Ser Lys 
25 30 35 40 

cac aaa agt ggt get gtg gec age ctg ctg age aag gca gag egg gec 109.8 
His Lys Ser Gly Ala Val Ala Ser Leu Leu Ser Lys Ala Glu Arg Ala 
45 50 55 

acg gag ctg gca gee gag gga cag ctg acg ctg cag cag ttt gcg cag 1146 
Thr Glu Leu Ala Ala Glu Gly Gin Leu Thr Leu Gin Gin Phe Ala Gin 
60 65 -.70 

tec aca gag atg ctg aag'cgc gtg gtg cag gag cat etc ccg ctg atg 1194 
Ser Thr Glu Met Leu Lys Arg Val Val Gin Glu His Leu Pro Leu Met 
75 80 85 

age gag gcg ggt get ggc ctg cct gac atg gag get gtg gca ggt gec 1242 
Ser Glu Ala Gly Ala Gly Leu Pro Asp Met Glu Ala Val Ala Gly Ala 
90 95 100 

gaa gee etc aat ggc cag tec gac ttc ccc tac ctg ggc get ttc ccc 1290 
Glu Ala Leu Asn Gly Gin Ser Asp Phe Pro Tyr Leu Gly Ala Phe Pro 
105 no us ~ 120 

ate aac cca ggc etc ttc att atg acc ccg gca ggt gtg ttc ctg gec 1338 
lie Asn Pro Gly Leu Phe He Met Thr Pro Ala Gly Val Phe Leu Ala 
125 130 ~ 135 

gag age gcg ctg cac atg gcg ggc ctg get gag tac ccc atg cag gga 1386 
Glu Ser Ala Leu His Met Ala Gly Leu Ala Glu Tyr Pro Met Gin Gly 
140 145 150 

gag ctg gec tct gec ate age tec ggc aag aag aag egg aaa cgc tgc 1434 
Glu Leu Ala Ser Ala He Ser Ser Gly Lys Lys Lys Arg Lys Arg Cys 
155 160 165 

ggc atg tgc gcg ccc tgc egg egg cgc ate aac tgc gag cag tgc age 1482 
Gly Met Cys Ala Pro Cys Arg Arg Arg .He Asn Cys Glu Gin Cys Ser 
170 175 180 

agt tgt agg aat cga aag act ggc cat cag att tgc aaa ttc aga aaa 1530 
Ser Cys Arg Asn Arg Lys Thr Gly His Gin He Cys Lys Phe Arg Lys 
185 190 195" 200 

"tgt gag gaa etc aaa aag aag cct tec get get ctg gag aag gtg atg 1578 
Cys Glu Glu Leu Lys Lys Lys Pro Ser Ala Ala Leu Glu Lys Val Met 
205 210 215 

ctt ccg acg gga gee gee ttc egg tgg ttt cag tgaeggegge ggaacccaaa 1631 
Leu Pro Thr Gly Ala Ala Phe Arg Trp Phe Gin 
220 225 

gctgccctct ccgtgcaatg tcactgctcg tgtggtctcc agcaagggat tegggegaag 1691 

acaaaeggat gcacccgtct ttagaaccaa aaatattctc tcacagattt cattcctgtt 1751 

tttatatata tattttttgt tgtcgtttta acatctccac gtccctagca taaaaagaaa 1811 
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aagaaaiaaaa 


tttaaactgc tttttcggaa gaacaacaac aaaaaagagg taaagacgaa 


1871 


tctataaagt 


accgagactt cctgggcaaa gaatggacaa tcagtttcct tcctgtgtcg 


1931 


atgtcgatgt 


tgtctgtgca ggagatgcag tttttgtgta gagaatgtaa' attttctgta 


1991 


accttttgaa 


atctagttac taataagcac tactgtaatt tagcacagtt taactccacc 


2051 


ctcatttaaa 


cttcctttga ttctttccga ccatgaaata gtgcatagtt tgcctggaga 


2111 


atccactcac 


gttcataaag agaatgttga tggcgccgtg tagaagccgc tctgtatcca 


2171 


tccacgcgtg 


cagagctgcc agcagggagc tcacagaagg ggagggagca ccaggccagc 


2231 


tgagctgcac 


ccacagtccc gagactggga tcccccaccc caacagtgat tttggaaaaa 


2291 


aaaatgaaag 


ttctgttcgt ttatccattg cgatctgggg agccccatct cgatatttcc 


2351 


aatcctggct 


acttttctta gagaaaataa gtcctttttt tctggccttg ctaatggcaa 


2411 


cagaagaaag 


ggcttctttg cgtggtcccc tgctggtggg ggtgggtccc cagggggccc 


2471 


cctgcggcct 


gggcccccct gcccacggcc agcttcctgc tgatgaacat gctgtttgta 


2531 


ttgttttagg 


aaaccaggct gttttgtgaa taaaacgaat gcatgtttgt gtcacgaaaa 


2591 


aaaaaaaaaa 


aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 


2632 



<210> 8 

<211> 227 
<212> PRT 

<213> NM_016463 HSPC195 
<400> 8 

Met Met Gly Gly Glu Ser Ala Asp Lys Ala Thr Ala Ala Ala Ala Ala 
1 5 10 15 

Ala Ser Leu Leu Ala Asn Gly His Asp Leu Ala Ala Ala Met Ala Val 
20 . 25 30 

Asp Lys Ser Asn Pro Thr Ser Lys His Lys Ser Gly Ala Val Ala Ser 
35 40 45 

Leu Leu Ser Lys Ala Glu Arg Ala Thr Glu Leu Ala Ala Glu Gly Gin 
50 55 60 

Leu Thr Leu Gin Gin Phe Ala Gin Ser Thr Glu Met Leu Lys Arg Val 
65 70 75 80 

Val Gin Glu His Leu Pro Leu Met Ser Glu Ala Gly Ala Gly Leu Pro 
85 90 95 
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Asp Met Glu Ala Val Ala Gly Ala Glu Ala Leu Asn Gly Gin Ser Asp 
100 105 110 
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Phe Pro Tyr Leu Gly Ala Phe Pro He Asn Pro Gly Leu Phe He Met 
115 120 125 



Thr Pro Ala Gly Val Phe Leu Ala Glu Ser Ala Leu His Met Ala Gly 
130 135 140 



Leu Ala Glu Tyr Pro Met Gin Gly Glu Leu Ala Ser Ala He Ser Ser 
145 150 155 160 



.Gly Lys Lys Lys Arg Lys Arg Cys Gly Met Cys Ala Pro Cys Arg Arg 
165 170 175 



Arg He Asn Cys Glu Gin Cys Ser Ser Cys Arg Asn Arg Lys Thr Gly 
180 185 190 



His Gin He Cys Lys Phe Arg Lys Cys Glu Glu Leu Lys Lys Lys Pro 
195 200 205 



Ser Ala Ala Leu Glu Lys Val Met Leu -Pro Thr Gly Ala Ala Phe Arg 
210 .215 220 

Trp Phe Gin 

225. 

<210> 9 
<211> 2140 
<212> DNA 

<213> NM 017697 FLJ20171 



<220> 

<221> CDS 

<222> (58) (1131) 

<223> 



<400> 9 

gaattcaaga aatgttgccc tggttcacct gatattgaca aactggacgt tgccaca 57 

atg aca gag tat tta aat ttt gag aag agt agt tea gtc tct cga tat 105 
Met Thr Glu Tyr Leu Asn Phe Glu Lys Ser Ser Ser Val Ser Arg Tyr 
1 5 10 15 

gga gec tct caa gtt gaa gat atg ggg aat ata att tta gca atg att 153 
Gly Ala Ser Gin Val Glu Asp Met Gly Asn He He Leu Ala Met He 
20 25 30 

tea gag cct tat aat cac agg ttt tea gat cca gag aga gtg aat tac 201 
Ser Glu Pro Tyr Asn His Arg Phe Ser Asp Pro Glu Arg Val Asn Tyr 
35 40 45 
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aag ttt gaa agt gga act tgc age aag atg gaa ctt att gat gat aac 249 
Lys Phe Glu Ser Gly Thr Cys Ser Lys Met Glu Leu lie Asp Asp Asn 
50 55 60 

acc gta gtc agg gca cga ggt tta cca tgg cag tct tea gat caa gat 297 
Thr Val Val Arg Ala Arg Gly Leu Pro Trp Gin Ser Ser Asp Gin Asp 

65 70 75 " 80 

att gca aga ttc ttc aaa gga etc aat att gec aag gga ggt gca gca 345 

lie Ala Arg Phe Phe Lys Gly Leu Asn He Ala Lys Gly Gly Ala Ala 
.85 • 90 95 

ctt tgt ctg aat get cag ggt cga agg aac gga gaa get ctg gtt agg 393 

Leu Cys Leu Asn Ala Gin Gly Arg Arg Asn Gly Glu Ala Leu Val Arg 
100 105 110 

ttt gta agt gag gag cac cga gac eta gca eta cag agg cac aaa cat 441 

Phe Val Ser Glu Glu His Arg Asp Leu Ala Leu Gin Arg His Lys His 
115 120 125 

cac atg ggg acc egg tat att gag gtt tac aaa gca aca ggt gaa gat 489 

His Met Gly Thr Arg Tyr lie Glu Val Tyr Lys Ala Thr Gly Glu Asp 
130 135 140 

ttc ctt aaa att get ggt ggt act tec aat gag gta gee cag ttt etc 537 

Phe Leu Lys He Ala Gly Gly Thr Ser Asn Glu Val Ala Gin Phe Leu 

145 150 155 160 

tec aag gaa aat caa gtc att gtt cgc atg egg ggg etc cct ttc acg 585 

Ser Lys Glu Asn Gin Val He Val Arg Met Arg Gly Leu Pro Phe Thr 
165 170 175 

gee aca get gaa gaa gtg gtg gee ttc ttt gga cag cat tgc cct att 633 

Ala Thr Ala Glu Glu Val Val Ala Phe Phe Gly Gin His Cys Pro He 
180 185 190 

act ggg gga aag gaa ggc ate etc ttt gtc acc tac cca gat ggt agg 681 

Thr Gly Gly Lys Glu Gly He Leu Phe Val Thr Tyr Pro Asp Gly Arg 
195 200 205 

cca aca ggg gac get ttt gtc etc ttt gee tgt gag gaa tat gca cag 729 

Pro Thr Gly Asp Ala Phe Val Leu Phe Ala Cys Glu Glu Tyr Ala Gin 
210 215 220 

aat gcg ttg agg aag cat aaa gac ttg ttg ggt aaa aga tac att gaa 777 

Asn Ala Leu Arg Lys His Lys Asp Leu Leu Gly Lys Arg Tyr He Glu 

225 230 235 " 240 

etc ttc agg age aca gca get gaa gtt cag cag gtg ctg aat cga ttc 825 

Leu Phe Arg Ser Thr Ala Ala Glu Val Gin Gin Val Leu Asn Arg Phe 
245 250 255 

tec teg gee cct etc att cca ctt cca acc cct ccc att att cca gta 873 

Ser Ser Ala Pro Leu He Pro Leu Pro Thr Pro Pro He He Pro Val 
260 265 270 

eta cct cag caa ttt gtg ccc cct aca aat gtt aga gac tgt ata cgc 921 

Leu Pro Gin Gin Phe Val Pro Pro Thr Ash Val Arg Asp Cys He Arg 
275 280 285 

ctt cga ggt ctt ccc tat gca gee aca att gag gac ate ctg gat ttc 969 

Leu Arg Gly Leu Pro Tyr Ala Ala Thr He Glu Asp He Leu Asp Phe 
290 295 300 

ctg ggg gag ttc gee aca gat att cgt act cat ggg gtt cac atg gtt 1017 

Leu Gly Glu Phe Ala Thr Asp He Arg Thr His Gly Val His Met Val 

305 310 315 320 
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ttg aat cac cag ggc cgc cca tea gga gat gec ttt ate cag atg aag 1065 

Leu Asn His Gin Gly Arg Pro Ser Gly Asp Ala Phe lie Gin Met Lys 
325 330 . 335 

tct gcg gac aga gca ttt atg get gca cag aag tgt cat aaa aaa aaa 1113 

Ser Ala Asp Arg Ala Phe Met Ala Ala Gin Lys Cys His Lys Lys Lys 
340 345 . 350 

cat gaa gga cag ata tgt tgaagtcttt cagtgttcag ctgaggagat 1161 
His Glu Gly Gin lie Cys ' 
355 



gaactttgtg 


*» *- aa *-y y y y y 


yuaULLUafla u^yactciLyy^ UtaL^LCCaC cgCCaL.yt.aa 


i 091 

1ZZ1 


gttaccatgc 


v- vy w c 


utLLLLataL dLLLUtaytL LLL^ CigCag CtaLCCCCaC 






all toCtagc 


cctctgtgat tttgaatcca cgagcactgc agccctccac 


1341 


agegea ctae 


ccagcaggca 


ctcagctctt catgaactac acagegtact atcccagccc 


1401 


cccaggttcg 


cctaatagtc 


ttggctactt ccctacagct gctaatctta gcggtgtccc 


1461 


tccacagcct 


ggcacggtgg 


teagaatgea gggcctggcc tacaatactg gagttaagga 


1521 


aattcttaac 


ttcttccaag 


gttaccagtg tttgaaagat gtatggtgat cttgaaacct 


1581 


ccagacacaa 


gaaaacttct 


agcaaattca ggggaagttt gtctacactc aggctgeagt 


1641 


attttcagca 


aacttgattg 


gaeaaaeggg cctgtgcctt atcttttggt ggagtgaaaa 


1701 


agtttgagct 


agtgaagcca 


aategtaact tacagcaagc ageatgeage' atacctggct 


1761 


etttgetgat 


tgcaaatagg 


catttaaaat gtgaatttgg aatcagatgt ctccattact 


1821 


tccagttaaa 


gtggcatcat 


aggtgtttcc taagttttaa gtcttggata aaaactccac 


1881 


cagtgtctac 


catctccacc 


atgaactctg ttaaggaagc ttcatttttg tatattcccg 


1941 


ctcttttctc 


ttcatttccc 


tgtcttctgc -ataatcatgc ettcttgeta agtaattcaa 


2001 


gcataagatc 


ttggaataat 


aaaatcacaa tcttaggaga aagaataaaa ttgttatttt 


2061 


cccagtctct 


tggecatgat 


gatatcttat gattaaaaac aaattaaatt ttaaaacacc 


2121 


tgaaaaaaaa 


aaaaaaaaa 




2140 



<210> 10 

<211> 358 

<212> PRT 

<213> NM_017697 FLJ20171 

<400> 10 

Met Thr Glu Tyr Leu Asn Phe Glu Lys Ser Ser Ser Val Ser Arg Tyr 
1 5 10 . 15 



Gly Ala Ser Gin Val Glu Asp Met Gly Asn He He Leu Ala Met He 
20 25 30 
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Ser Glu Pro Tyr Asn His Arg Phe Ser Asp Pro Glu Arg Val Asn Tyr 
.35 40 . 45 



Lys Phe Glu Ser Gly Thr Cys Ser Lys Met Glu Leu He Asp Asp Asn 
50 55 60 



Thr Val Val Arg Ala Arg Gly Leu Pro Trp Gin Ser Ser Asp Gin Asp 
65 70 75 80 



He Ala Arg Phe Phe Lys Gly Leu Asn He Ala Lys Gly Gly Ala Ala 
85 90 . "* 95 



Leu Cys Leu Asn Ala Gin Gly Arg Arg Asn Gly Glu Ala Leu Val Arg 
100 . 105 - 110 



Phe Val Ser Glu Glu His Arg Asp Leu Ala Leu Gin Arg His Lys His 
115 120 125 



His Met Gly Thr Arg Tyr He Glu Val Tyr Lys Ala Thr Gly Glu Asp 
130 135 140 



Phe Leu Lys He Ala Gly Gly Thr Ser Asn Glu Val Ala Gin Phe Leu 
145 150 155 160 



Ser Lys Glu Asn Gin Val He Val Arg Met Arg Gly Leu Pro Phe Thr 
165 170 175 



Ala Thr Ala Glu Glu Val Val Ala Phe Phe Gly Gin His Cys Pro He 
180 185 ' 190 



Thr Gly Gly Lys Glu Gly He Leu Phe Val Thr Tyr Pro Asp Gly Arg 
195 200 205 



Pro Thr Gly Asp Ala Phe Val Leu Phe Ala Cys Glu Glu Tyr Ala Gin 
210 215 220 



Asn Ala Leu Arg Lys His Lys Asp Leu Leu Gly Lys Arg Tyr He Glu 
225 230 235 240 



Leu Phe Arg Ser Thr Ala Ala Glu Val Gin Gin Val Leu Asn Arg Phe 
245 250 255 



Ser Ser Ala Pro Leu He Pro Leu Pro Thr Pro Pro He He Pro Val 
260 265 270 



Leu Pro Gin Gin Phe Val Pro Pro Thr Asn Val Arg Asp Cys He Arg 
275 280 285 



Leu Arg Gly Leu Pro Tyr Ala Ala Thr He Glu Asp He Leu Asp Phe 
290 295 300 
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Leu Gly Glu Phe Ala Thr Asp lie Arg Thf His Gly Val His Met Val 
305 310 315 320 



Leu Asn His Gin Gly Arg Pro Ser Gly Asp Ala Phe lie Gin Met Lys 
325 '330 335 



Ser Ala Asp Arg Ala Phe Met Ala Ala Gin Lys Cys His Lys Lys Lys 
340 345 350 



His Glu Gly Gin lie Cys 
355 



<210> 11 

<211> 2808 

<212> DNA 

<213> NM 052886 MAL2 



<220> 

<221> CDS 

<222> (80).. (607) 

<223> 



<400> 11 

ggcggcggcg gcaggagccc gggaggcgga ggcgggaggc ggcggcggcg cgcggagacg 60 

cagcagcggc agcggcagc atg teg gee ggc gga gcg tea gtc ccg ccg ccc 112 
Met Ser Ala Gly Gly Ala Ser Val Pro Pro Pro 
15 10 

ccg aac ccc gec gtg tec ttc ccg ccg ccc egg gtc acc ctg ccc gee 160 
Pro Asn Pro Ala Val Ser Phe Pro Pro Pro Arg Val Thr Leu Pro Ala 
15 20 25 

ggc ccc gac ate ctg egg acc tac teg ggc gec ttc gtc tgc ctg gag 208 
' Gly Pro Asp lie Leu Arg Thr Tyr Ser Gly Ala Phe Val Cys ' Leu Glu 
30 35 40 

att ctg ttc ggg ggt ctt gtc tgg att ttg gtt gec tec tec aat gtt 256 
He Leu Phe Gly Gly Leu Val Trp He Leu Val Ala Ser Ser Asn Val 
45 50 55 

cct eta cct eta eta caa gga tgg gtc atg ttt gtg tec gtg aca gcg 304 
Pro Leu Pro Leu Leu Gin Gly Trp Val Met Phe Val Ser Val Thr Ala 
60 65 70 75 

ttt ttc ttt teg etc etc ttt ctg ggc atg ttc etc tct ggc atg gtg 352 
Phe Phe Phe Ser Leu Leu Phe Leu Gly Met Phe Leu Ser Gly Met Val 
80 85 90 

get caa att gat get aac tgg aac ttc ctg gat ttt gec tac cat ttt 400 
Ala Gin He Asp Ala Asn Trp Asn Phe Leu Asp Phe Ala Tyr His Phe 
95 100 105 



WO 2004/022778 



30 



PCT/AU2003/001166 



aca gta ttt gtc ttc tat ttt gga gcc ttt tta ttg gaa gca'gca gcc 448 
Thr Val Phe Val Phe Tyr Phe Gly Ala Phe Levi Leu Glu Ala Ala Ala 
110 115 ' -120 

aca tec ctg cat gat ttg cat tgc aat aca acc ata acc ggg cag cca 496 
Thr Ser Leu His Asp Leu His Cys Asn Thr Thr He Thr Gly Gin Pro 
125 130 135 

etc ctg agt gat aac cag tat aac ata aac gta gca gcc tea att ttt 544 
Leu Leu Ser Asp Asn Gin Tyr Asn He Asn Val Ala Ala Ser He Phe 
140 145 150 155 

gcc ttt atg acg aca get tgt tat ggt tgc agt ttg ggt ctg get tta 592 
Ala Phe Met Thr Thr Ala Cys Tyr Gly Cys Ser Leu Gly Leu Ala Leu 
160 165 170 

" cga aga tgg cga ccg taacactcct tagaaactgg cagtegtatg ttagtttcac 647 
Arg Arg Trp Arg Pro 
175 



ttgtctactt 


tatatgtctg 


atcaatttgg 


ataccatttt 


gtccagatgc 


aaaaacattc 


707 


caaaagtaat 


gtgtttagta 


gagagagact 


ctaagctcaa 


gttctggttt 


atttcatgga 


767 


tggaatgtta 


attttattat 


gatattaaag 


aaatggcctt 


ttattttaca 


tctctcccct 


827 


ttttcccttt 


ccccctttat 


tttcctcctt 


ttctttctga 


aagtttcctt 


ttatgtccat 


887 


aaaatacaaa 


tatattgttc 


ataaaaaatt 


agtatccctt 


ttgtttggtt 


gctgagtcac 


947 


ctgaacctta 


attttaattg 


gtaattacag 


cccctaaaaa 


aaacacattt 


caaataggct 


1007 


tcccactaaa 


ctctatattt 


tagtgtaaac 


caggaattgg 


cacacttttt 


ttagaatggg 


1067 


ccagatggta 


aatatttatg 


cttcacggtc 


catacagtct 


ctgtcacaac 


tattcagttc 


1127 


tgctagtata 


gcgtgaaagc 


agctatacac 


aatacagaaa 


tgaatgagtg 


tggttatgtt 


1187 


ctaataaaac 


ttatttataa 


aaacaagggg 


aggctgggtt 


tagcctgtgg 


gccatagttt 


1247 


gtcaaccact 


ggtgtaaaac 


cttagttata 


tatgatctgc 


attttcttga 


actgatcatt 


1307 


gaaaacttat 


aaacctaaca 


gaaaagecac 


ataatattta 


gtgtcattat 


gcaataatca 


1367 


cattgecttt 


gtgttaatag 


tcaaatactt 


acctttggag 


aatacttacc 


tttggaggaa 


1427 


tgtataaaat 


ttctcaggca 


gagtcctgga 


tataggaaaa 


agtaatttat 


gaagtaaact 


1487 


teagttgett 


aatcaaacta 


atgatagtct 


aacaactgag 


caagatcctc 


atctgagagt 


1547 


gcttaaaatg 


ggatccccag 


agaccattaa 


ccaatactgg 


aactggtatc 


tagctactga 


1607 


tgtcttactt 


tgagtttatt 


tatgettcag 


aatacagttg 


tttgccctgt 


geatgaatat 


1667 


acccatattt 


gtgtgtggat 


atgtgaagct 


tttccaaata 


gagctctcag 


aagaattaag 


1727 


tttttacttc 


taattatttt 


gcattacttt 


gagttaaatt 


tgaatagagt 


attaaatata 


1787 


aagttgtaga 


ttcttatgtg 


tttttgtatt 


agcccagaca 


tctgtaatgt 


ttttgcactg 


1847 


gtgacagaca 


aaatctgttt 


taaaatcata 


tccagcacaa 


aaactatttc 


tggctgaata 


1907 


gcacagaaaa 


gtattttaac 


ctacctgtag 


agatcctcgt 


catggaaagg 


tgccaaactg 


1967 


ttttgaatgg 


aaggacaagt 


aagagtgagg 


ccacagttcc 


caccacacga 


gggcttttgt 


2027 


attgttctac 


tttttcagee 


ctttactttc 


tggctgaagc 


atccccttgg 


agtgccatgt 


2087 
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ataagttggg- 


ctattagagt 


31 

tcatggaaca tagaacaacc atgaatgagt ggcatgatcc 


2147 


gtgcttaatg 


atcaagtgtt 


acttatctaa taatcctcta gaaagaaccc tgttagatct 


2207 


tggtttgtga 


taaaaatata 


aagacagaag acatgaggaa aaacaaaagg tttgaggaaa 


2267 


tcaggcatat 


gactttatac 


ttaacatcag atcttttcta taatatccta ctactttggt 


2327 


tttcctagct 


ccataccaca 


cacctaaacc tgtattatga attacatatt acaaagtcat 


2387 


aaatgtgcca 


tatggatata 


cagtacattc tagttggaat cgtttactct gctagaattt 


2447 


aggtgtgaga 


ttttttgttt 


cccaggtata gcaggcttat gtttggtggc attaaattgg 


2507 


tttctttaaa 


atgctttggt 


ggcacttttg taaacagatt gcttctagat tgttacaaac 


2567 


caagcctaag 


acacatctgt 


gaatacttag atttgtagct taatcacatt ctagacttgt 


2627 


gagttgaatg 


acaaagcagt 


tgaacaaaaa ttatggcatt taagaattta acatgtctta 


2687 


gctgtaaaaa 


tgagaaagtg 


ttggttggtt ttaaaatctg gtaactccat gatgaaaaga 


2747 


aatttatttt 


atacgtgtta 


tgtctctaat aaagtattca tttgataaaa aaaaaaaaaa 


2807 


a 






2808 



<210> 12 

<211> 176 

<212> PRT 

<213> NM_052886 MAL2 

<400> 12 

Met Ser Ala Gly Gly Ala Ser Val Pro Pro Pro Pro Asn Pro Ala Val 
1 5 10 15 

Ser Phe Pro Pro Pro Arg Val Thr Leu Pro Ala Gly Pro Asp lie Leu 
20 25 30 

Arg Thr Tyr Ser Gly Ala Phe Val Cys Leu Glu lie Leu Phe Gly Gly 
35 40 45 

Leu Val Trp lie Leu Val Ala Ser Ser Asn Val Pro Leu Pro Leu Leu 
50 55 60 

Gin Gly Trp Val Met Phe Val- Ser Val Thr Ala Phe Phe Phe Ser Leu 
65 70 75 80 

Leu Phe Leu Gly Met Phe Leu Ser Gly Met Val Ala Gin lie Asp Ala 
85 90 95 

Asn Trp Asn Phe Leu Asp Phe Ala Tyr His' Phe Thr Val Phe Val Phe 
100 105 110 



Tyr Phe Gly Ala Phe Leu Leu Glu Ala Ala Ala Thr Ser Leu His Asp 



1 
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Leu His Cys Asn Thr Thr lie Thr Gly Gin Pro Leu Leu Ser Asp Asn 
130 135 140 



Gin Tyr Asn lie Asn Val Ala Ala Ser lie Phe Ala Phe Met Thr Thr 
145 150 155 160 



Ala Cys Tyr Gly Cys Ser Leu Gly Leu Ala Leu Arg Arg Trp Arg Pro 
165 170 175 



<210> 13 

<211> 4171 

<212> DNA 

<213> NM_015238 KIAA0869 f KIBRA 
<220> 

<221> CDS 

<222> (67).. (3405) 

<223> 



<400> 13 

ctggagccgc tgagcccccg ctgcggccgg gagctgcatg ggggagcgcc ggcagcgctt 60 

gggaag atg ccc egg ccg gag ctg ccc ctg ccg gag ggc tgg gag gag 108 
Met Pro Arg Pro Glu Leu Pro Leu Pro Glu Gly Trp Glu Glu 
1 5 10 

gcg cgc gac ttc gac ggc aag gtc tac tac ata gac cac acg aac cgc 156 
Ala Arg Asp Phe Asp Gly Lys Val Tyr Tyr lie Asp His Thr Asn Arg 
15 20 25 30 

acc acc age tgg ate gac ccg egg gac agg tac ace aaa ccg etc ace 204 
Thr Thr Ser Trp lie Asp Pro Arg Asp Arg Tyr Thr Lys Pro Leu Thr 
35 40 45 



ttt get gac tgc att agt gat gag ttg ccg eta gga tgg gaa gag gea 252 
Phe Ala Asp Cys He Ser Asp Glu Leu Pro Leu Gly Trp Glu Glu Ala 
50 55 60 

tat gac cca cag gtt gga gat tac ttc ata gac cac aac acc aaa acc 300 
Tyr Asp Pro Gin Val Gly Asp Tyr Phe He Asp His Asn Thr Lys Thr 
65 70 75 

act cag att gag gat cct cga gta caa tgg egg egg gag cag gaa cat 348 
Thr Gin He Glu Asp Pro Arg Val Gin Trp Arg Arg Glu Gin Glu His 
80 85 90 

atg ctg aag gat tac ctg gtg gtg gee cag gag get ctg agt gca caa 396 
Met Leu Lys Asp Tyr Leu Val Val Ala Gin Glu Ala Leu Ser Ala Gin 
95 100 105 110 

aag gag ate tac cag gtg aag cag cag cgc ctg gag ctt gca cag cag 444 
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Lys Glu lie Tyr Gin Val Lys Gin Gin Arg Leu Glu Leu Ala Gin Gin 

115 . 120 125 

gag tac cag caa ctg cat gcc gtc tgg gag cat aag ctg ggc tec cag 492 

Glu Tyr Gin Gin Leu His Ala Val Trp Glu His Lys Leu Gly Ser .Gin 
130 135 . 140 

gtc age ttg gtc tct ggt tea tea tec age tec aag tat gac cct gag 540 

Val Ser Leu Val Ser Gly Ser Ser Ser Ser Ser Lys Tyr Asp Pro Glu 
145 150 155 

ate ctg aaa get gaa att gcc act gca* aaa tec egg gtc aac aag ctg 588 

He Leu Lys Ala Glu He Ala Thr Ala Lys Ser Arg Val Asn Lys Leu 
160 165 170 



aag aga gag atg gtt cac etc cag cac gag ctg cag ttc aaa gag cgt 636 
Lys Arg Glu Met Val His Leu Gin His Glu Leu Gin Phe Lys Glu Arg 
175 180 185 190 

ggc ttt cag ace ctg aag aaa ate gat. aag aaa atg tct gat get cag 684 
Gly Phe Gin Thr Leu Lys Lys He Asp Lys Lys Met Ser Asp Ala Gin 
195 200 205 

ggc age tac aaa ctg gat gaa get cag get gtc ttg aga gaa aca aaa 732 
Gly Ser Tyr Lys Leu Asp Glu Ala Gin Ala Val Leu Arg Glu Thr Lys 
210 215 220 

' gcc ate aaa aag get att ace tgt ggg gaa aag gaa aag caa gat etc . 780 
Ala He Lys Lys Ala lie Thr Cys Gly Glu Lys Glu Lys Gin Asp Leu 
225 230 235 

att aag age ctt gcc atg ttg aag gac ggc' ttc cgc act gac agg ggg 828 
He Lys Ser Leu Ala Met Leu Lys Asp Gly Phe Arg Thr Asp Arg Gly 
240 245 250 

tct cac tea gac ctg tgg tec age age age tct ctg gag agt teg agt 876 
Ser His Ser Asp Leu Trp Ser Ser Ser Ser Ser Leu Glu Ser Ser Ser 
255 260 265 270 

ttc ccg eta ccg aaa cag tac ctg gat gtg age tec cag aca gac ate 924 
Phe Pro Leu Pro Lys Gin Tyr Leu Asp Val Ser Ser Gin Thr Asp He 
275 280 . 285 

teg gga age ttc ggc ate aac age aac aat cag ttg gca gag aag gtc 972 
Ser Gly Ser Phe Gly He Asn Ser Asn Asn Gin Leu Ala Glu Lys Val 
290 295 300 



aga ttg cgc ctt cga tat gaa gag get aag aga agg ate gcc aac ctg 1020 

Arg Leu Arg Leu Arg Tyr Glu Glu Ala Lys Arg Arg He Ala Asn Leu 
305 310 315 

aag ate cag ctg gcc aag ctt gac agt gag gcc tgg cct ggg gtg ctg 1068 
Lys He Gin Leu Ala Lys Leu Asp Ser Glu Ala Trp Pro Gly Val Leu 
320 325 330 

gac tea gag agg gac egg ctg ate ctt ate aac gag aag gag gag ctg 1116 

Asp Ser Glu Arg Asp Arg Leu He Leu He Asn Glu Lys Glu Glu Leu 
335 340 345 350 

ctg aag gag atg cgc ttc ate age ccc cgc aag tgg acc cag ggg gag 1164 

Leu Lys Glu Met Arg Phe He Ser Pro Arg Lys Trp Thr Gin Gly Glu 

355 360 . 365 

gtg gag cag ctg gag atg gcc egg aag egg ctg gaa aag gac ctg cag 1212 

Val Glu Gin Leu Glu Met Ala Arg Lys Arg Leu Glu Lys Asp Leu Gin 
370 375 380 
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gca gcc egg gac acc cag age aag g C g ctg acg- gag agg tta aag tta 
Ala Ala Arg Asp Thr Gin Ser Lys Ala Leu Thr Glu Arg Leu Lys Leu 
385 390 . • 395 

aac agt aag agg aac cag ctt gtg a ga gaa ctg gag gaa gcc acc egg 
Asn Ser Lys Arg Asn Gin Leu Val Arg Glu Leu Glu Glu Ala Thr Arg 
400 405 410 

cag gtg gca act ctg cac tec cag ctg aaa agt etc tea age age atg 
Gin Val Ala Thr Leu His Ser Gin Leu. Lys. Ser Leu Ser Ser Ser Met 
415 420 ~ .425 430 



cag tec ctg tec tea ggc age age ccc gga tec etc acg tec age egg 
Gin Ser Leu Ser Ser Gly Ser Ser Pro Gly Ser Leu Thr Ser Ser Arg 
435 440 445 

ggc tec ctg gtt gca tec age ctg gac tec tec act tea gcc age ttc 
Gly Ser Leu Val Ala Ser Ser Leu Asp Ser Ser Thr Ser Ala Ser Phe 
450 455 . , 460 

act gac etc tac tat gac ccc ttt gag cag ctg. gac tea gag ctg cag 
Thr Asp Leu Tyr Tyr Asp Pro Phe Glu Gin Leu Asp Ser Glu Leu Gin 
465 470 475 

age aag gtg gag ttc ctg etc ctg gag ggg gcc acc ggc ttc egg ccc 
Ser Lys Val Glu Phe Leu Leu Leu Glu Gly Ala Thr Gly Phe Arg Pro 
480 485 490 . 

tea ggc tgc ate acc acc ate cac gag gat gag gtg gcc aag acc cag 
Ser Gly Cys He Thr Thr He His Glu Asp Glu Val Ala Lys Thr Gin 
495 500 505 510 

aag gca gag gga ggt ggc cgc ctg cag get ctg cgt tec ctg tct ggc 
Lys Ala Glu Gly Gly Gly Arg Leu Gin Ala Leu Arg Ser Leu Ser Gly 
515 520 525 

acc cca aag tec atg acc tec eta tec' cca cgt tec tct etc tec tec 
Thr Pro Lys Ser Met Thr Ser Leu Ser Pro Arg Ser Ser Leu Ser Ser 
530 535 540 

ccc tec cca ccc tgt tec cct etc atg - get gac ccc etc ctg get ggt 
Pro Ser Pro Pro Cys Ser Pro Leu Met Ala Asp Pro Leu Leu Ala Gly 
545 550 555 

gat gcc ttc etc aac tec ttg gag ttt gaa gac ccg gag ctg agt gcc 
Asp Ala Phe Leu Asn Ser Leu Glu Phe Glu Asp Pro Glu Leu Ser Ala 
560 565 570 

act ctt tgt gaa ctg age ctt ggt aac age gcc cag gaa aga tac egg 
Thr Leu Cys Glu Leu Ser Leu Gly Asn Ser Ala Gin Glu Arg Tyr Arg 
575 580 585 590 

ctg gag gaa cca gga acg gag ggc aag cag ctg ggc caa get gtg aat 
Leu Glu Glu Pro Gly Thr Glu Gly Lys Gin Leu Gly Gin Ala Val Asn 
595 600 605 

acg gcc cag ggg tgt ggc ctg aaa gtg gcc tgt gtc tea gcc gcc gta 
Thr Ala Gin Gly Cys Gly Leu Lys Val Ala Cys Val Ser Ala Ala Val 
610 615 620 

teg gac gag tea gtg get gga gac agt ggt gtg tac gag get tec gtg 
Ser Asp Glu Ser Val Ala Gly Asp Ser Gly Val Tyr Glu Ala Ser Val 
625 630 635 

cag aga ctg ggt get tea gaa get get gca ttt gac agt gac gaa teg 



1260 
1308 
1356 

1404 
1452 
1500 
1548 
1596 
1644 
1692 
1740 

1788 
1836 
1884 
1932 
1980 
2028 
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Gin Arg Leu Gly Ala Ser Glu Ala Ala Ala Phe Asp Ser Asp Glu Ser 
640 645 650 

gaa gca gtg ggt gcg acc cga att cag att gcc ctg aag tat gat gag 2076 
Glu Ala Val Gly Ala Thr Arg lie Gin He Ala Leu Lys Tyr Asp Glu 
655 660 ' 665 670 

aag aat aag caa ttt gca ata tta ate ate cag ctg agt aac ctt tct 2124 
Lys Asn Lys Gin Phe Ala He Leu He He Gin Leu Ser Asn Leu Ser 
675 680 685 



get ctg ttg cag caa caa gac cag aaa gtg aat ate cgc gtg get gtc 2172 
Ala Leu Leu Gin Gin Gin Asp Gin Lys Val Asn He Arg Val Ala Val 
690 695 700 

ctt cct tgc tct gaa age aca acc tgc ctg ttc egg acc egg cct ctg 2220 
Leu Pro Cys Ser Glu Ser Thr Thr Cys Leu Phe Arg Thr Arg Pro Leu 
705 710 715 

gac gcc tea gac act eta gtg ttc aat gag gtg ttc tgg gta tec atg 2268 
Asp Ala Ser Asp Thr Leu Val Phe Asn Glu Val Phe Trp Val Ser Met 
720 725 730 

tec tat cca gcc ctt cac cag aag acc tta aga gtc gat gtc tgt acc 2316 
Ser Tyr Pro Ala Leu His Gin Lys Thr Leu Arg Val Asp Val Cys Thr 
735 740 745 750 

acc gac agg age cat ctg gaa gag tgc ctg gga ggc gcc cag ate age 2364 
Thr Asp Arg Ser His Leu Glu Glu Cys Leu Gly Gly Ala Gin He Ser 
755 760 765 

ctg gcg gag gtc tgc egg tct ggg gag agg teg act cgc tgg tac aac 2412 
Leu Ala Glu Val Cys Arg Ser Gly Glu Arg Ser Thr Arg Trp Tyr Asn 
770 775 780 

ctt etc age tac aaa tac ttg aag aag cag age agg gag etc aag cca 2460 
Leu Leu Ser Tyr Lys Tyr Leu Lys Lys Gin Ser Arg Glu Leu Lys Pro 
785 790 795 

gtg gga gtt atg gcc cct gcc tea ggg cct gcc age acg gac get gtg 2508 
Val Gly Val Met Ala Pro Ala Ser Gly Pro Ala Ser Thr Asp Ala Val 
800 805 810 



tct get ctg ttg gaa cag aca gca gtg gag ctg gag aag agg cag gag 2556 
Ser Ala Leu Leu Glu' Gin Thr Ala Val Glu Leu Glu Lys Arg Gin Glu 
8*5 820 825 ' 830 

ggc agg age age aca cag aca ctg gaa gac age tgg agg tat gag gag 2604 
Gly Arg Ser Ser Thr Gin Thr Leu Glu Asp Ser Trp Arg Tyr Glu Glu 
835 840 ~ 845 

acc agt gag aat gag gca gta gcc gag gaa gag gag gag gag gtg gag 2652 
Thr Ser Glu Asn Glu Ala Val Ala Glu Glu Glu Glu Glu Glu Val Glu 
850 855 860 

gag gag gag gga gaa gag gat gtt ttc acc gag aaa gcc tea cct gat 2700 
Glu Glu Glu Gly Glu Glu Asp Val Phe Thr Glu Lys Ala Ser Pro Asp 
865 870 875 

atg gat ggg tac cca gca tta aag gtg gac aaa gag acc aac acg gag 2748 
Met Asp Gly Tyr Pro Ala Leu Lys Val Asp Lys Glu Thr Asn Thr Glu 
880 885 890 

acc ccg gcc cca tec ccc aca gtg gtg cga cct aag gac egg aga gtg 2796 
Thr Pro Ala Pro Ser Pro Thr Val Val Arg Pro Lys Asp Arg Arg Val 
895 900 905 * 910 
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ggc acc ccg tec cag ggg cca ttt ctt cga ggg age acc ate ate cgc 2844 
Gly Thr Pro Ser Gin Gly Pro Phe Leu Arg Gly Ser Thr He He Arg 
915 . 920 925 

tct aag acc ttc tec cca gga ccc cag age cag tac gtg tgc egg ctg 2892 
Ser Lys Thr Phe Ser Pro Gly Pro Gin Ser Gin Tyr Val Cys Arg Leu 
930 935 " 940 



aat egg agt gat agt gac age tec act ctg tec aaa aag cca cct ttt 2940 
Asn Arg Ser Asp Ser Asp Ser Ser Thr Leu Ser Lys' Lys Pro Pro Phe 
945 950 955 

' gtt cga aac tec ctg gag cga cgc age gtc egg atg aag egg cct tec 2988 
Val Arg Asn Ser Leu Glu Arg Arg Ser Val Arg Met Lys Arg Pro Ser 
960 965 970 

teg gtc aag teg ctg cgc tec gag cgt ctg ate cgt acc teg ctg gac 3036 
Ser Val Lys Ser Leu Arg Ser Glu Arg Leu He Arg Thr Ser Leu Asp 
975 980 985 990 

ctg gag tta gac ctg cag gcg aca aga acc tgg cac age caa ctg acc 3084 
Leu Glu Leu Asp Leu Gin Ala Thr Arg Thr Trp His Ser Gin Leu Thr 
995 1000 • 1005 

cag gag ate teg gtg ctg aag gag etc aag gag cag ctg gaa caa 3129 
Gin Glu He Ser Val Leu Lys Glu Leu- Lys Glu Gin Leu Glu Gin 
1010 1015 1020 

gee aag age cac ggg gag aag gag ctg cca cag tgg ttg cgt gag 3174 
Ala Lys Ser His Gly Glu Lys Glu Leu Pro Gin Trp Leu Arg Glu 
1025 1030 1035 

gac gag cgt ttc cgc ctg ctg ctg agg atg ctg gag aag egg cag 3219 
Asp Glu Arg Phe Arg Leu Leu Leu Arg Met Leu Glu Lys Arg Gin 
1040 1045 1050 

atg gac cga gcg gag cac aag ggt gag ctt cag aca gac aag atg 3264 
Met Asp Arg Ala Glu His Lys Gly Glu Leu Gin Thr Asp Lys Met 
1055 1060 1065 



atg agg gca get gee aag gat gtg cac agg etc cga ggc cag age 3309 
Met Arg Ala Ala Ala Lys Asp Val His Arg Leu Arg Gly Gin Ser 
1070 1075 , 1080 

tgt aag gaa ccc cca gaa gtt cag tct' ttc agg gag aag atg gca 3354 
Cys Lys Glu fro Pro Glu Val Gin Ser Phe Arg Glu Lys Met Ala 
1085 1090 1095 

ttt ttc acc egg cct egg atg aat ate cca get etc tct gca gat 3399 
Phe Phe Thr Arg Pro Arg Met Asn He Pro Ala Leu Ser Ala Asp 
1100 1105 • 1110 

gac gtc taatcgccag aaaagtattt cctttgttcc actgaccagg ctgtgaacat 3455 
Asp Val 

tgactgtggc taaagttatt tatgtggtgt tatatgaagg tactgagtca caagtcctct 3515 

agtgctcttg ttggtttgaa gatgaaccga ctttttagtt tgggtcctac tgttgttatt 3575 

aaaaacagaa caaaaacaaa acacacacac acacaaaaac agaaacaaaa aaaaccagca 3635 

ttaaaataat aagattgtat agtttgtata tttaggagtg tatttttggg aaagaaaatt 3695 

taaatgaact aaagcagtat tgagttgctg ctcttcttaa aatcgtttag attttttttg 3755 
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gtttgtacag 


ctccaccttt 


t aa aa cr t c 1 1 




ctgggggacg 


■301 C 


gtaatcctaa 


taggacgtcc 


cgcacttgtc 


acagtacagc taatttttcc 


tagttaacat 


3875 


attttgtaca 


atattaaaaa 


aatgcacaga 


aaccattggg ggggattcag 


aggtgeatec 


3935 


acggatcttc 


ttgagctgtg 


acgtgttttt 


atgtggctgc ccaacgtgga 


gcgggcagtg 


3995 


tgataggctg 


ggtgggctaa 


gcagcctagt 


ctatgtgggt gacaggccac 


gctggtctca 


• 4055 


gatgcccagt 


gaagccacta 


acatgagtga 


ggggagggct gtggggaact 


ccattcagtt 


4115 


ttatctccat 


caataaagtg 


gcctttcaaa 


aagaaaaaaa aaaaaaaaaa 


aaaaaa 


4171 



<210> 14 

<211> 1113 

<212> PRT 

<213> NMJU5238 KIAA0869, KIBRA 

<400> 14 

Met Pro Arg Pro Glu Leu Pro Leu Pro Glu Gly Trp Glu Glu Ala Arg 
1 5 10 " " 15 



Asp Phe Asp Gly Lys Val Tyr Tyr lie Asp His Thr Asn Arg Thr Thr 
20 25 30 



Ser Trp lie Asp Pro Arg Asp Arg Tyr Thr Lys Pro Leu Thr Phe Ala 
35 40 45 



Asp Cys lie Ser Asp Glu Leu Pro Leu Gly Trp Glu Glu Ala Tyr Asp 
50 55 60. 



Pro Gin Val Gly Asp Tyr Phe He Asp His Asn Thr Lys Thr Thr Gin 
65 70 75 * 80 



He Glu Asp Pro Arg Val Gin Trp Arg Arg Glu Gin Glu His Met Leu 
85 90 95 



Lys Asp Tyr Leu Val Val Ala Gin Glu Ala Leu Ser Ala Gin Lys Glu 
100 105 HO 



He Tyr Gin Val Lys Gin Gin Arg Leu Glu Leu Ala Gin Gin Glu Tyr 
115 120 125 



Gin Gin Leu His Ala Val Trp Glu His Lys Leu Gly Ser Gin Val Ser 
130 135 140 



Leu Val Ser Gly Ser Ser Ser Ser Ser Lys Tyr Asp Pro Glu He Leu 
145 150 155 160 
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Lys Ala Glu He Ala Thr Ala Lys Ser. Arg Val Asn Lys Leu Lys Arg 
165 170 175 



Glu Met Val His Leu Gin His Glu Leu Gin Phe Lys Glu Arg Gly Phe 
180 185 190 



Gin- Thr Leu Lys Lys He Asp Lys Lys Met Ser Asp Ala Gin Gly Ser 
195 200 205 



Tyr Lys Leu Asp Glu Ala Gin Ala Val Leu Arg Glu Thr Lys Ala He 
210 215 220 



Lys Lys Ala He Thr Cys Gly Glu Lys Glu Lys Gin Asp Leu He Lys 
225 230 235 240 



Ser Leu Ala Met Leu Lys Asp Gly Phe Arg Thr Asp Arg Gly Ser His 
245 250 . 255 



Ser Asp Leu Trp Ser Ser Ser Ser Ser Leu Glu Ser Ser Ser Phe Pro 
260 265 270 



Leu Pro Lys Gin Tyr Leu Asp Val Ser Ser Gin Thr Asp He Ser Gly 
275 280 285 



Ser Phe Gly He Asn Ser Asn Asn Gin Leu Ala Glu Lys Val Arg Leu 
290 295 300 



Arg Leu Arg Tyr Glu Glu Ala Lys Arg Arg He Ala Asn Leu Lys He 
305 310 315 320 



Gin Leu Ala Lys Leu Asp Ser Glu Ala Trp Pro Gly Val Leu Asp Ser 
325 . 330 335 



Glu Arg Asp Arg Leu He Leu He Asn Glu Lys Glu Glu Leu Leu Lys 
340 345 350 



Glu Met Arg Phe He Ser Pro Arg Lys frp Thr Gin Gly Glu Val Glu 
355 360 365 



Gin Leu Glu Met Ala Arg Lys Arg Leu Glu Lys Asp Leu Gin Ala Ala 
370 375 380 



Arg Asp Thr Gin Ser Lys Ala Leu Thr Glu Arg Leu Lys Leu Asn Ser 
385 390 395 400 



Lys Arg Asn Gin Leu Val Arg Glu Leu Glu Glu Ala Thr Arg Gin Val 
405 410 415 



Ala Thr Leu His Ser Gin Leu Lys Ser Leu Ser Ser Ser Met Gin Ser 
420 425 430 
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Leu Ser Ser Gly Ser Ser Pro Gly Ser Leu Thr Ser Ser Arg Gly Ser 
435 440 445 



Leu Val Ala Ser Ser Leu Asp Ser Ser Thr Ser Ala Ser Phe Thr Asp 
450 455 460 



Leu Tyr Tyr Asp Pro Phe Glu Gin Leu Asp Ser Glu Leu Gin Ser Lys 
465 470 475 480 



Val Glu Phe Leu Leu Leu Glu Gly Ala Thr Gly Phe Arg Pro Ser Gly 
485 490 495 



Cys lie Thr Thr He His Glu Asp Glu Val Ala Lys Thr Gin Lys Ala 
500 505 510 



Glu Gly Gly Gly Arg Leu Gin Ala Leu Arg Ser Leu Ser Gly Thr Pro 
515 520 525 



Lys Ser Met Thr Ser Leu Ser Pro Arg Ser Ser Leu Ser Ser Pro Ser 
530 535 540 



Pro Pro Cys Ser Pro Leu Met Ala Asp Pro Leu Leu Ala Gly Asp Ala 
545 550 555 " 560 . 



Phe Leu Asn Ser Leu Glu Phe Glu Asp Pro Glu Leu Ser Ala Thr Leu 
565 570 575 



Cys Glu Leu Ser Leu Gly Asn Ser Ala Gin Glu Arg Tyr Arg Leu Glu 
580 585 590 



Glu Pro Gly Thr Glu Gly Lys Gin Leu Gly Gin Ala Val Asn Thr Ala 
595 600 605 



Gin Gly Cys Gly Leu Lys Val Ala Cys Val Ser Ala Ala Val Ser Asp 
610 615 620 



Glu Ser Val Ala Gly Asp Ser Gly Val Tyr Glu Ala Ser Val Gin Arg 
625 630 635 640 



Leu Gly Ala Ser Glu Ala Ala Ala Phe Asp Ser Asp Glu Ser Glu Ala 
645 650 655 



Val Gly Ala Thr Arg He Gin He Ala Leu Lys Tyr Asp Glu Lys Asn 
660 665 670 



Lys Gin Phe Ala He Leu He He Gin Leu Ser Asn Leu Ser Ala Leu 
675 680 685 



Leu Gin Gin Gin Asp Gin Lys Val Asn He Arg Val Ala Val Leu Pro 
690 695 700 
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Cys Ser Glu Ser Thr Thr Cys Leu Phe Arg Thr Arg Pro Leu Asp Ala 
705 710 • 715 720 



Ser Asp Thr Leu Val Phe Asn Glu Val Phe Trp Val Ser Met Ser Tyr 
725 .730 735 



Pro Ala Leu His Gin Lys Thr Leu Arg Val Asp Val Cys Thr Thr Asp 
740 745 750 



Arg Ser His Leu Glu Glu Cys Leu Gly Gly Ala Gin lie Ser Leu Ala 
755 760 765 



Glu Val Cys Arg Ser Gly Glu Arg Ser Thr Arg Trp Tyr Asn Leu Leu 
770 775 780 



Ser Tyr Lys Tyr Leu Lys Lys Gin Ser Arg Glu Leii Lys Pro Val Gly 
785 790 795 800 



Val Met Ala Pro Ala Ser Gly Pro Ala Ser Thr Asp Ala Val Ser Ala 
805 810 815 



Leu Leu Glu Gin Thr Ala Val Glu Leu Glu Lys Arg Gin Glu Gly Arg 
820 825 830 



Ser Ser Thr Gin Thr Leu Glu Asp Ser Trp Arg Tyr Glu Glu Thr Ser 
835 840 ' 845 



Glu Asn Glu Ala Val Ala Glu Glu Glu Glu Glu Glu Val Glu Glu Glu 
850 855 860 



Glu Gly Glu Glu Asp Val Phe Thr Glu Lys Ala Ser Pro Asp Met Asp 
865 870 875 880 



Gly Tyr Pro Ala Leu Lys Val Asp Lys Glu Thr Asn Thr Glu Thr Pro 
885 890 895 



Ala Pro Ser Pro Thr Val Val Arg Pro Lys Asp Arg Arg Val Gly Thr 
900 905 910 



Pro Ser Gin Gly Pro Phe Leu Arg Gly Ser Thr He He Arg Ser Lys 
915 920 925 



Thr Phe Ser Pro Gly Pro Gin Ser Gin Tyr Val Cys -Arg Leu Asn Arg 
930 935 940 



Ser Asp Ser Asp Ser Ser Thr Leu Ser Lys Lys Pro Pro Phe Val Arg 
945 950 955 960 



Asn Ser Leu Glu Arg Arg Ser Val Arg Met Lys Arg Pro Ser Ser Val 
965 ~ 970 975 
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Lys Ser Leu Arg Ser Glu Arg Leu He Arg Thr Ser Leu Asp Leu Glu 
980 985 990 



Leu Asp Leu Gin Ala Thr Arg Thr Trp- His Ser Gin Leu Thr Gin Glu 
995 1000 1005 



.He Ser Val Leu Lys Glu Leu Lys Glu Gin Leu Glu Gin Ala Lys 
1010 1015 1020 



Ser His Gly Glu Lys Glu Leu Pro Gin Trp Leu Arg Glu Asp Glu 
1025 1030 1035 



Arg Phe Arg Leu Leu Leu Arg Met Leu Glu Lys Arg Gin Met Asp 
1040 1045 1050 



Arg Ala Glu His Lys Gly Glu Leu Gin Thr Asp Lys Met Met Arg 
1055 1060 • 1065 



Ala Ala Ala Lys Asp Val His Arg Leu Arg Gly Gin Ser Cys Lys 
1070 1075 " 1080 



Glu Pro Pro Glu Val Gin Ser Phe Arg Glu Lys Met Ala Phe Phe 
1085 1090 1095 



Thr Arg Pro Arg Met Asn He Pro Ala Leu Ser Ala Asp Asp Val 
1100 1105 1110 



<210> 15 

<2ll> 1294 

<212> DNA 

<213> NM_001306 CLDN3, claudin 3 

<220> 

<221> CDS 

<222> (198) (857) 

<223> 



<400> 15 

caaagccaca ggcaggtgca ggcgcagccg cggcgagagc gtatggagcc gagccgttag 60 

cgcgcgccgt cggtgagtca gtccgtccgt ccgtccgtcc gtcggggcgc cgcagctccc 120 

gccaggccca gcggccccgg cccctcgtct ccccgcaccc ggagccaccc ggtggagcgg 180 

gccttgccgc ggcagcc atg tec atg "ggc ctg gag ate acg ggc ace gcg 230 
Met Ser Met Gly Leu Glu He Thr Gly Thr Ala 



1 
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1 5 ■ 10 

ctg gcc gtg ctg ggc tgg ctg ggc acc ate gtg tgc tgc gcg ttg ccc 278 
Leu Ala Val Leu Gly Trp Leu Gly Thr He Val Cys Cys Ala Leu Pro 
15 20 25 

atg tgg cgc gtg teg gcc ttc ate ggc age aac ate ate acg teg cag 326 
Met Trp Arg Val Ser Ala Phe He Gly Ser Asn He He Thr Ser Gin 
30 35 40 

aac ate tgg gag ggc ctg tgg atg aac tgc gtg gtg cag age acc ggc 374 
.Asn He Trp Glu Gly Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly 
45 50 55 

cag atg cag tgc aag gtg tac gac teg ctg ctg gca ctg cca cag gac 422 
Gin Met Gin Cys Lys Val Tyr Asp Ser Leu Leu Ala Leu Pro Gin Asp 
60 65 70 75 

ctt cag gcg gcc cgc gcc etc ate gtg gtg gcc ate ctg ctg gcc gcc 470 
Leu Gin Ala Ala Arg Ala Leu He Val Val Ala He Leu Leu Ala Ala 
80 85 90 

ttc ggg ctg eta gtg gcg ctg gtg ggc gcc cag tgc acc aac tgc gtg 518 
Phe Gly Leu Leu Val Ala Leu Val Gly Ala Gin Cys Thr Asn Cys Val 
95 100 105 

cag gac gac acg gcc aag gcc aag ate acc ate gtg gca ggc gtg ctg 566 
Gin Asp Asp Thr Ala Lys Ala Lys He Thr He Val Ala Gly Val Leu 
HO lis 120 

ttc ctt etc gcc gcc ctg etc acc etc gtg ccg gtg tec tgg teg gcc 614 
Phe Leu Leu Ala Ala Leu Leu Thr Leu Val Pro Val Ser Trp Ser Ala 
125 130 135 

aac acc att ate egg gac ttc tac aac- ccc gtg gtg ccc gag gcg cag 662 
Asn Thr He He Arg Asp Phe Tyr Asn Pro Val Val Pro Glu Ala Gin 
140 145 150 155 

aag cgc gag atg ggc gcg ggc ctg tac gtg ggc tgg gcg gcc gcg gcg 710 
Lys Arg Glu Met Gly Ala Gly Leu Tyr Val Gly Trp Ala Ala Ala Ala 
160 165 170 

ctg cag ctg ctg ggg ggc gcg ctg etc tgc tgc teg tgt ccc cca cgc 758 
Leu Gin Leu Leu Gly Gly Ala Leu Leu Cys Cys Ser Cys Pro Pro Arg 
175 180 185 

gag aag aag tac acg gcc acc aag gtc gtc tac tec gcg ccg cgc tec 806 
Glu Lys Lys Tyr Thr Ala Thr Lys Val Val Tyr Ser Ala Pro Arg Ser 
190 195 200 

acc ggc ccg gga gcc age ctg ggc aca ggc tac gac cgc aag gac tac 854 
Thr Gly Pro Gly Ala Ser Leu Gly Thr Gly Tyr Asp Arg Lys Asp Tyr 
205 210 215 

gtc taagggacag aegcagggag accccaccac caccaccacc accaacacca 907 

Val 

220 



ccaccaccac cgcgagctgg agcgcgcacc aggccatcca gcgtgcagcc ttgcctcgga 967 

ggccagccca cccccagaag ccaggaagcc cccgcgctgg actggggcag cttccccagc 1027 

agccacggct ttgcgggccg ggcagtcgac ttcggggccc agggaccaac ctgcatggac 1087 

tgtgaaacct cacccttctg gagcaegggg cctgggtgac cgccaatact tgaccacccc 1147 

gtcgagcccc atcgggccgc tgcccccatg ctcgcgctgg gcagggaccg gcagccctgg 1207 
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aaggggcact tgatattttt caataaaagc ctttcgtttt gcaaaaaaaa aaaaaaaaaa 1267 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 12 g 4 



<210> 


16 


<211> 


220 


<212> 


PRT 


<213> 


NM_C 


<400> 


16 



Met Ser Met Gly Leu Glu He Thr Gly Thr Ala Leu Ala Val Leu Gly 
15 10 15 



Trp Leu Gly Thr He Val Cys Cys Ala Leu Pro Met Trp Arg Val Ser 
20 25 30 



Ala Phe He Gly Ser Asn He He Thr Ser Gin Asn He Trp Glu Gly 
35 40 45 



Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys Lys 
50 55 60 



Val Tyr Asp Ser Leu Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala Arg 
65 70 75 80 

Ala Leu He Val Val Ala He Leu Leu Ala Ala Phe Gly Leu Leu Val 
85 90 95 

Ala Leu Val Gly Ala Gin Cys Thr Asn Cys Val Gin Asp Asp Thr Ala 
100 105 no 



Lys Ala Lys He Thr lie Val Ala Gly Val Leu Phe Leu Leu Ala Ala 
115 120 125 



Leu Leu Thr Leu Val Pro Val Ser Trp Ser Ala Asn Thr He He Arg 
130 135 140 



Asp Phe Tyr Asn Pro Val Val Pro Glu Ala Gin Lys Arg Glu Met Gly 
i45 150 155 " 160 

Ala Gly Leu Tyr Val Gly Trp Ala Ala Ala Ala Leu Gin Leu Leu Gly 
165 170 175 



Gly Ala Leu Leu Cys Cys Ser Cys Pro Pro Arg Glu Lys Lys Tyr Thr 
180 185. 190 



Ala Thr Lys Val Val Tyr Ser Ala Pro Arg Ser Thr Gly Pro Gly Ala 
195 200 205 
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Ser Leu Gly Thr Gly Tyr Asp Arg Lys Asp Tyr Val 
210 " 215 220 

<210> 17 

<211> 1853 

<212> DNA 

<213> NM 022454 SOX17 



<220> 

<221> CDS 

<222> (205) . . (1446) 

<223> 



<400> 17 

gcagtgtcac taggccggct gggggccctg ggtacgctgt agaccagacc gcgacaggcc 60 

agaacacggg cggcggcttc gggccgggag acccgcgcag ccctcggggc atctcagtgc 120 

ctcactcccc accccctccc ccgggtcggg ggaggcggcg cgtccggcgg agggttgagg 180 

ggagcggggc aggcctggag cgcc atg age age ccg gat gcg gga tac gec 231 

Met Ser Ser Pro Asp Ala Gly Tyr Ala 
1 5 

agt gac gac cag age cag acc cag age gcg ctg ccc gcg gtg atg gec 279 
Ser Asp Asp Gin Ser Gin Thr Gin Ser Ala Leu Pro Ala Val Met Ala 
10 15 20 25 

ggg ctg ggc ccc tgc ccc tgg gec gag teg ctg age ccc ate ggg gac 327 
Gly Leu Gly Pro Cys Pro Trp Ala Glu Ser Leu Ser Pro lie Gly Asp 
30 35 40 

atg aag gtg aag ggc gag gcg ccg gcg aac age gga gca ccg gec ggg 375 
Met Lys Val Lys Gly Glu Ala Pro Ala Asn Ser Gly Ala Pro Ala Gly 
45 50 ' - 55 

gee gcg ggc cga gec aag ggc gag tec cgt ate egg egg ccg atg aac 423 
Ala Ala Gly Arg Ala Lys Gly Glu Ser Arg lie Arg Arg Pro Met Asn 
60 65 70 

get ttc atg gtg tgg get aag gac gag cgc aag egg ctg gcg cag cag 471 

Ala Phe Met Val Trp Ala Lys Asp Glu Arg Lys Arg Leu Ala Gin Gin 
75 80 85 

aat cca gac ctg cac aac gec gag ttg age aag atg ctg ggc aag teg 519 
Asn Pro Asp Leu His Asn Ala Glu Leu Ser Lys Met Leu Gly Lys Ser 
90 95 100 105 

tgg aag gcg ctg acg ctg gcg gag aag egg ccc ttc gtg gag gag gca 567 
Trp Lys Ala Leu Thr Leu Ala Glu Lys Arg Pro Phe Val Glu Glu Ala 
110 115 120 

gag egg ctg cgc gtg cag cac atg cag gac cac ccc aac tac aag tac 615 
Glu Arg Leu Arg Val Gin His Met Gin Asp His Pro Asn Tyr Lys Tyr 
125 130 135 
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egg ccg egg egg cgc aag cag gtg aag egg ctg aag egg gtg gag ggc 663 
Arg Pro Arg Arg Arg Lys Gin Val Lys Arg Leu Lys Arg Val Glu Gly 
140 145 ; 150 

ggc ttc ctg cac ggc ctg get gag ccg cag gcg gee gcg ctg ggc ccc 711 
Gly Phe Leu His Gly Leu Ala Glu Pro Gin Ala Ala Ala Leu Gly Pro 
155 160 165. 

gag ggc ggc cgc gtg gec atg gac ggc ctg ggc etc cag ttc ccc gag 759 
Glu Gly Gly Arg Val Ala Met Asp Gly Leu Gly Leu Gin Phe Pro Glu 
170 175 180 185 

cag ggc ttc ccc gee ggc ccg ccg ctg ctg cct ccg cac atg ggc ggc 807 
Gin Gly Phe Pro Ala Gly Pro Pro Leu Leu Pro Pro His Met Gly Gly 
190 195 200 

cac tac cgc gac tgc cag agt ctg ggc gcg cct ccg etc gac ggc tac 

His Tyr Arg Asp Cys Gin Ser Leu Gly Ala Pro Pro Leu Asp Gly Tvr 
•205 . 210 215 

ccg ttg ccc acg ccc "gac acg tec ccg ctg gac ggc gtg gac ccc gac 
Pro Leu Pro Thr Pro Asp Thr Ser Pro Leu Asp Gly Val Asp Pro Asd 
220 225 230 

l° 9 lt C ^° 9CC 9CC ° Cg atg CCC ggg gac tgc cc 9 9cg gec ggc 951 
Pro Ala Phe Phe Ala Ala Pro Met Pro Gly Asp Cys Pro Ala Ala Glv 
235 240 245 

ace tac age tac gcg cag gtc teg gac tac get ggc ccc ccg gag cct 
Thr Tyr Ser Tyr Ala Gin Val Ser Asp Tyr Ala Gly Pro Pro Glu Pro 
250 255 260 265 

ccc gee ggt ccc atg cac ccc cga etc ggc cca gag ccc gcg ggt ccc 
Pro Ala Gly Pro Met His Pro Arg Leu Gly Pro Glu Pro Ala Gly Pro 
27 0 275 280 



855 



903 



999 



1047 



teg att ccg ggc etc ctg gcg cca ccc age gec ctt cac gtg tac tac 1095 
Ser He Pro Gly Leu Leu Ala Pro Pro : Ser Ala Leu His Val Tyr Tyr 
285 290 295 

ggc gcg atg ggc teg ccc ggg gcg ggc ggc ggg cgc ggc ttc cag atg 1143 
Gly Ala Met Gly Ser Pro Gly Ala Gly Gly Gly Arg Gly Phe Gin Met 
300 305 310 

cag ccg caa cac cag cac cag cac cag cac cag cac cac ccc ccg ggc 1191 
Gin Pro Gin His Gin His Gin His Gin His Gin His His Pro Pro Glv 
315 320 325 

ccc gga cag ccg teg ccc cct ccg gag gca ctg ccc tgc egg gac ggc 1239 

Pro Gly Gin Pro Ser Pro Pro Pro Glu Ala Leu Pro Cys Arg Asp Glv 
330 335 340 345 

acg gac ccc agt cag ccc gee gag etc etc ggg gag gtg gac cgc acg 1287 
Thr Asp Pro Ser Gin Pro Ala Glu Leu Leu Gly Glu Val Asp Arg Thr 
350 355 360 

gaa ttt gaa cag tat ctg cac ttc gtg tgc aag cct gag atg ggc etc 1335 
Glu Phe Glu Gin Tyr Leu His Phe Val Cys Lys Pro Glu Met Gly Leu 
365 370 375 

ccc tac cag ggg cat gac tec ggt gtg aat etc ccc gac age cac ggg 1383 
Pro Tyr Gin Gly His Asp Ser Gly Val Asn Leu Pro Asp Ser His Gly 
380 385 390 

gec att tec teg gtg gtg tec gac gee age tec gcg gta tat tac tgc 1431 
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Ala lie Ser Ser Val Val Ser Asp Ala Ser Ser Ala Val Tyr Tyr Cys 
395 400 405 

aac tat cct gac gtg tgacaggtcc ctgatccgc'c ccagcctgca ggccagaagc 1486 

Asn Tyr Pro Asp Val 

410 



agtgttacac 


acttcctgga 


ggagctaagg 


aaatcctcag actcctgggt 


ttttgttgtt 


1546 


gctgttgttg 


ttttttaaaa 


ggtgtgttgg 


catataattt atggtaattt 


attttgtctg 


1606 


ccacttgaac 


agtttggggg 


ggtgaggttt 


catttaaaat ttgttcagag 


atttgtttcc 


1666 


catagttgga 


ttgtcaaaac 


cctatttcca 


agttcaagtt aactagcttt 


gaatgtgtcc 


1726 


caaaacagct 


tcctccattt 


cctgaaagtt 


tattgatcaa agaaatgttg 


tcctgggtgt 


1786 


gttttttcaa 


tcttctaaaa 


aataaaatct 


ggaatcctga aaaa'aaaaaa 


aaaaaaaaaa 


1846 


aaaaaaa 










1853 



<210> 18 
<211> 414 ■ 
<212> PRT 

<213> NM_022454 SOX17 
<400> 18 

Met Ser Ser Pro Asp Ala Gly Tyr Ala Ser Asp Asp Gin Ser Gin Thr 
15 10 15 

Gin Ser Ala Leu Pro Ala Val Met Ala Gly Leu Gly Pro Cys Pro Trp 
20 25 30 



Ala Glu Ser Leu Ser Pro lie Gly Asp Met Lys Val Lys Gly Glu Ala 
35 40 45 



Pro Ala Asn Ser Gly Ala Pro Ala Gly Ala Ala Gly Arg Ala Lys Gly 
50 55 60 



Glu Ser Arg lie Arg Arg Pro Met Asn Ala Phe Met Val Trp Ala Lys 
65 70 . 75 80 



Asp Glu Arg Lys Arg Leu Ala Gin Gin Asn Pro Asp Leu His Asn Ala 
85 90 95 



Glu Leu Ser Lys Met Leu Gly Lys Ser Trp Lys Ala Leu Thr Leu Ala 
100 105 HO 



Glu Lys Arg Pro Phe Val Glu Glu Ala Glu Arg Leu Arg Val Gin His 
115 120 125 



Met Gin Asp His Pro Asn Tyr Lys Tyr Arg Pro Arg Arg Arg Lys Gin 
130 135 140 
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Val Lys Arg Leu Lys Arg Val Glu Gly Gly Phe Leu His Gly Leu Ala 
145 150 155 160 



Glu Pro Gin Ala Ala Ala Leu Gly Pro Glu Gly Gly Arg Val Ala Met 
165 ' 170 175 



Asp Gly Leu Gly Leu Gin Phe Pro Glu Gin Gly Phe Pro Ala Gly Pro 
180 185 190 



Pro Leu Leu Pro Pro His Met Gly Gly His Tyr Arg Asp Cys Gin Ser 
195 200 205 



Leu Gly Ala Pro Pro Leu Asp Gly Tyr Pro Leu Pro Thr Pro Asp Thr 
210 215 220 



Ser Pro Leu Asp Gly Val Asp Pro Asp Pro Ala Phe Phe Ala Ala Pro 
225 230 235 240 



Met Pro Gly Asp Cys Pro Ala Ala Gly Thr Tyr Ser Tyr Ala Gin Val 
245 250 255 



Ser Asp Tyr Ala Gly Pro Pro Glu Pro Pro Ala Gly Pro Met His Pro 
260 265 - 270 



Arg Leu Gly Pro Glu Pro Ala Gly " Pro Ser He Pro Gly Leu Leu Ala 
275 280 285 



Pro Pro Ser Ala Leu His Val Tyr Tyr Gly Ala Met Gly Ser Pro Gly 
290 295 300 



Ala Gly Gly Gly Arg Gly Phe Gin Met Gin Pro Gin His Gin His Gin 
305 310 315 320 



His Gin His Gin His His Pro Pro Gly Pro Gly Gin Pro Ser Pro Pro 
325 330 335 



Pro Glu Ala Leu Pro Cys Arg Asp Gly Thr Asp Pro Ser Gin Pro Ala 
340 345 350 



Glu Leu Leu Gly Glu Val Asp .Arg Thr Glu Phe Glu Gin Tyr Leu His 
355 360 365 



Phe Val Cys Lys Pro Glu Met Gly Leu Pro Tyr Gin Gly His Asp Ser 
370 375 380 



Gly Val Asn Leu Pro Asp Ser His Gly Ala He Ser Ser Val Val Ser 
385 390. 395 400 



Asp Ala Ser Ser Ala Val Tyr Tyr Cys Asn Tyr Pro Asp Val 
405 410 
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<210> 19 

<211> 3702 

<212> DNA 

<213> NMJ)05682 GPRS 6 

<220> 

<221> CDS 

<222> (159).. (2237) 
<223> 



<400> 19 

tggagggtct cgctctgtca cacaggctgg agtgcagtgg tgtgatcttg gctcatcgta . 60 

acctccacct cccgggttca agtgattctc atgcctcagc ctcccgagta gctgggatta 120 

caggtggtga cttccaagag tgactccgtc ggaggaaa atg act ccc cag teg ctg 176 

Met Thr Pro Gin Ser Leu 
1 5 

ctg cag acg aca ctg ttc ctg ctg agt ctg etc ttc ctg gtc caa ggt 224 
Leu Gin Thr Thr Leu Phe Leu Leu Ser Leu Leu Phe Leu Val Gin Gly 
10 15 - 20 

gee cac ggc agg ggc cac agg gaa gac ttt cgc ttc tgc age cag egg 272 
Ala His Gly Arg Gly His Arg Glu Asp Phe Arg Phe Cys Ser Gin Arg 
25 30 35 

aac cag aca cac agg age age etc cac tac aaa ccc aca cca gac ctg 320 
Asn Gin Thr His Arg Ser Ser Leu His Tyr Lys Pro Thr Pro Asp Leu 
40 45 50 

cgc ate tec ate gag aac tec gaa gag gee etc aca gtc cat gee cct 368 
Arg lie Ser He Glu Asn Ser Glu Glu Ala Leu Thr Val His Ala Pro 
55 60 65 70 

ttc cct gca gee cac cct get tec cga tec ttc cct gac ccc agg ggc 416 
Phe Pro Ala Ala His Pro Ala Ser Arg Ser Phe Pro Asp Pro Arq Gly 
75 80 85 

etc tac cac ttc tgc etc tac tgg aac cga cat get ggg aga tta cat 464 
Leu Tyr His Phe Cys Leu Tyr Trp Asn Arg His Ala Gly Arg Leu His 
90 95 100 

ctt etc tat ggc aag cgt gac ttc ttg ctg agt gac aaa gee tct age 512 
Leu Leu Tyr Gly Lys Arg Asp Phe Leu Leu Ser. Asp Lys Ala Ser Ser 
105 no us 

etc etc tgc ttc cag cac cag gag gag age ctg get cag ggc ccc ccg 560 
Leu Leu Cys Phe Gin His Gin Glu Glu Ser Leu Ala Gin Gly Pro Pro 
120 . 125 130. 

ctg tta gee act tct gtc ace tec tgg tgg age cct cag aac ate age 608 
Leu Leu Ala Thr Ser Val Thr Ser Trp Trp Ser Pro Gin Asn He Ser 
135 140 145 150 



ctg ccc agt gee gee age ttc ace ttc tec ttc cac agt cct ccc cac 



656 
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Leu Pro Ser Ala Ala Ser Phe Thr Phe. Ser Phe His Ser Pro Pro His 
155 160 165 

lu 9 IT C *° aat gcc tcg gtg gac at 9 gag etc aaa agg gac 704 
Thr Ala Ala Has Asn Ala Ser Val Asp Met Cys Glu Leu Lys Arg Asp 
170 175 " 180 

etc cag ctg etc age cag ttc ctg aag cat ccc cag aag gcc tea agg 752 
Leu Gin Leu Leu Ser Gin Phe Leu Lys His Pro Gin Lys Ala Ser Arg 
185 190 195 

agg ccc tcg get gcc ccc gcc age cag cag ttg cag age ctg gag tcg 800 
Arg Pro Ser Ala Ala Pro Ala Ser Gin Gin lieu Gin Ser Leu Glu Ser 
200 205 210 

aaa ctg acc tct gtg aga ttc atg ggg gac atg gtg tec ttc gag gag 848 
Lys Leu Thr Ser Val Arg Phe Met Gly Asp Met Val Ser Phe Glu Glu 
215 2 20 225 230 

gac egg ate aac gcc acg gtg tgg aag etc cag ccc aca gcc ggc etc 896 
Asp Arg He Asn Ala Thr Val Trp Lys Leu Gin Pro Thr Ala Gly Leu 
235 240 245 

cag gac ctg cac ate cac tec egg cag gag gag gag cag age gag ate 944 
Gin Asp Leu Has He His Ser Arg Gin Glu Glu Glu Gin Ser Glu lie 
250 255 260 

atg gag tac tcg gtg ctg ctg cct cga aca etc ttc cag agg acg aaa 
Met Glu Tyr Ser Val Leu Leu Pro Arg Thr Leu Phe Gin Arg Thr Lys 
265 270 .275 

ggc egg age ggg gag get gag aag aga etc etc ctg gtg gac ttc age 
Gly Arg Ser Gly Glu Ala Glu Lys Arg Leu Leu Leu Val Asp Phe Ser 
2 «0 285 290 

age caa gcc ctg ttc cag gac aag aat tec age cac gtc ctg ggt gag 
Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser Ser His Val Leu Gly Glu 
295 300 305 . 310 

aag gtc ttg ggg att gtg gta cag aac acc aaa gta gcc. aac etc acg 1136 
Lys Val Leu Gly He Val Val Gin Asn Thr Lys Val Ala Asn Leu Thr 



992 



1040 



1088 



315 320 



325 



gag ccc gtg gtg etc acc ttc cag cac cag eta cag ccg aag aat gtg 1184 
Glu Pro Val Val Leu Thr Phe Gin His Gin Leu Gin Pro Lyi Asn Vat 
330 335 



340 



1280 



1328 



act ctg caa tgt gtg ttc tgg gtt gaa gac ccc aca ttg age age ccg 1232 
Thr Leu Gin Cys Val Phe Trp Val Glu Asp Pro Thr Leu Ser Ser Pro 
345 350 355 

ggg cat tgg age agt get ggg tgt gag acc gtc agg aga gaa acc caa 
Gly His Trp Ser Ser Ala Gly Cys Glu Thr Val Arg Arg Glu Thr Gin 
360 365 370 

aca tec tgc ttc tgc aac cac ttg acc tac ttt gca gtg ctg atg gtc 
Thr Ser Cys Phe Cys Asn His Leu Thr Tyr Phe Ala Val Leu Met Val 
375 380 385 390 

tec tcg gtg gag gtg gac gcc gtg cac aag cac tac ctg age etc etc 1376 
Ser Ser Val Glu Val Asp Ala Val His Lys His Tyr Leu Ser Leu Leu 
395 400 405 

tec tac gtg ggc tgt gtc gtc tct gcc ctg gcc tgc ctt gtc acc att 1424 
Ser Tyr Val Gly Cys Val Val Ser Ala Leu Ala Cys Leu Val Thr He 
410 415 * " 420 

gcc gcc tac etc tgc tec. agg gtg ccc ctg ccg tgc agg agg aaa cct 1472 
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Ala Ala Tyr Leu Cys Ser Arg Val Pro Leu Pro Cys Arg Arg Lys Pro 
425 430 435 

egg gac tac acc ate aag gtg cac atg aac ctg ctg ctg gee gtc ttc 1520 
Arg Asp Tyr Thr lie Lys Val His Met Asn Leu Leu Leu Ala Val Phe 
440 445 450 

ctg ctg gac acg age ttc ctg etc age gag ccg gtg gee ctg aca ggc 1568 
Leu Leu Asp Thr Ser Phe Leu Leu Ser Glu Pro Val Ala Leu Thr Gly 
455 460 465 470 

tct gag get ggc tgc cga gec agt gee ate ttc ctg cac ttc tec ctg 1616 
Ser Glu Ala Gly Cys Arg Ala Ser Ala lie Phe Leu His Phe Ser Leu 
475 480 485 

etc acc tgc ctt tec tgg atg ggc etc gag ggg tac aac etc tac cga 1664 
Leu Thr Cys Leu Ser Trp Met Gly Leu Glu Gly Tyr Asn Leu Tyr Arg 
490 495 " * 500 

etc gtg gtg gag gtc ttt ggc acc tat gtc cct ggc tac eta etc aag 1712 
Leu Val Val Glu Val Phe Gly Thr Tyr Val Pro Gly Tyr Leu Leu Lys 
505 510 515 

ctg age gee atg ggc tgg ggc ttc ccc ate ttt ctg gtg acg ctg gtg 1760 
Leu Ser Ala Met Gly Trp Gly Phe Pro lie Phe Leu Val Thr Leu Val 
520 525 530 



gec ctg gtg gat gtg gac aac tat ggc ccc ate ate ttg get gtg cat 
Ala Leu Val Asp Val Asp Asn Tyr Gly Pro lie lie Leu Ala Val His 
535 540 545 550 

agg act cca gag ggc gtc ate tac cct tec atg tgc tgg ate egg gac 
Arg Thr Pro Glu Gly Val lie Tyr Pro Ser Met Cys Trp He Arg Asp 
555 560 565 

tec ctg gtc age tac ate acc aac ctg ggc etc ttc age ctg gtg ttt 
Ser Leu Val Ser Tyr He Thr Asn Leu Gly Leu Phe Ser Leu Val Phe 
570 575 580 

ctg ttc aac atg gee atg eta gec acc -atg gtg gtg cag ate ctg egg 
Leu Phe Asn Met Ala Met Leu Ala Thr Met Val Val Gin He Leu Arg 
585 -590 595 



1808 



1856 



1904 



1952 



ctg cgc ccc cac acc caa aag tgg tea cat gtg ctg aca ctg ctg ggc 2000 
Leu Arg Pro His Thr Gin Lys Trp Ser His Val Leu Thr Leu Leu Gly 
600 ... 605 610 

etc age ctg gtc ctt ggc ctg ccc tgg gee ttg ate ttc ttc tec ttt 2048 
Leu Ser Leu Val Leu Gly Leu Pro Trp Ala Leu He Phe Phe Ser Phe 
615 620 625 630 

get tct ggc acc ttc cag ctt gtc gtc etc tac ctt ttc age ate ate 2096 
Ala Ser Gly Thr Phe Gin Leu Val Val Leu Tyr Leu Phe Ser He He 
635 640 645 

acc tec ttc caa ggc ttc etc ate ttc ate tgg tac tgg tec atg egg 2144 
Thr Ser Phe Gin Gly Phe Leu He Phe He Trp Tyr Trp Ser Met Arg 
650 655 660 

ctg cag gee egg ggt ggc ccc tec cct ctg aag age aac tea gac age 2192 
Leu Gin Ala Arg Gly Gly Pro Ser Pro Leu Lys Ser Asn Ser Asp Ser 
665 670 675 

gec agg etc ccc ate age teg ggc age acc teg tec age cgc ate 2237 
Ala Arg Leu Pro He Ser Ser Gly Ser Thr Ser Ser Ser Arg He 
680 685 690 

taggcctcca gcccacctgc ccatgtgatg aagcagagat gcggcctcgt cgcacactgc 2297 
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ctgtggcccc 


cgagccaggc ccagccccag gccagtcagc cgcagacttt ggaaagccca 


2357 


acgaccatgg 


agagatgggc cgttgccatg gtggacggac tcccgggctg ggcttttgaa 


2417 


ttggccttgg 


ggactactcg gctctcactc agctcccacg ggactcagaa gtgcgccgcc 


2477 


atgctgccta 


gggtactgtc cccacatctg tcccaaccca gctggaggcc tggtctctcc 


2537 


ttacaaaccc 


tgggcccagc cctcattgct gggggccagg ccttggatct tgagggtctg 


2597 


gcacatcctt 


aatcctgtgc ccctgcctgg gacagaaatg tggctccagt tgctctgtct 


2657 


ctcgtggtca 


ccctgagggc actctgcatc ctctgtcatt ttaacctcag gtggcaccca 


2717 


gggcgaatgg 


ggcccagggc agaccttcag ggccagagcc ctggcggagg agaggccctt 


2777 


tgccaggagc 


acagcagcag ctcgcctacc tctgagccca ggccccctcc ctccctcagc 


2837 


cccccagtcc 


tccctccatc ttccctgggg ttctcctcct ctcccagggc ctccttgctc 


2897 


cttcgttcac 


agctgggggt ccccgattcc aatgctgttt tttggggagt ggtttccagg 


2957 


agctgcctgg 


tgtctgctgt aaatgtttgt ctactgcaca agcctcggcc tgcccctgag 


3017 


ccaggctcgg 


taccgatgcg tgggctgggc taggtccctc tgtccatctg ggcctttgta 


3077 


tgagctgcat 


tgcccttgct caccctgacc aagcacacgc ctcagagggg ccctcagcct 


3137 


ctcctgaagc 


ccfcctfcgtgg caagaactgt ggaccatgcc agtcccgtct ggtttccatc 


3197 


ccaccactcc 


aaggactgag actgacctcc tctggtgaca ctggcctaga gcctgacact 


3257 


ctcctaagag 


gttctctcca agcccccaaa tagctccagg cgccctcggc cgcccatcat 


3317 


ggttaattct 


gtccaacaaa cacacacggg tagattgctg gcctgttgta ggtggtaggg 


3377 


acacagatga 


ccgacctggt cactcctcct gccaacattc agtctggtat gtgaggcgtg 


3437 


cgtgaagcaa 


gaactcctgg agctacaggg acagggagcc atcattcctg cctgggaatc 


3497 


ctggaagact 


tcctgcagga gtcagcgttc aatcttgacc ttgaagatgg gaaggatgtt 


3557 


ctttttacgt 


accaattctt ttgtcttttg atattaaaaa gaagtacatg ttcattgtag 


3617 


agaatttgga 


aactgtagaa gagaatcaag aagaaaaata aaaatcagct gttgtaatcg 


3677 


cctagcaaaa 


aaaaaaaaaa aaaaa 


3702 



<210> 20 

<211> 693 

<212> PRT 

<213> NM 005682 GPRS 6 



<400> 20 

Met Thr Pro Gin Ser Leu Leu Gin Thr Thr Leu Phe Leu Leu Ser Leu 
15 10 15 



Leu Phe Leu Val Gin Gly Ala His Gly Arg Gly His Arg Glu Asp Phe 
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20 25 30 

Arg Phe Cys Ser Gin Arg Asn Gin Thr His Arg Ser Ser Leu His Tyr 
35 40 45 

Lys Pro Thr Pro Asp Leu Arg He Ser He Glu Asn Ser Glu Glu Ala 
50 55 60 

Leu Thr Val His Ala Pro Phe Pro Ala Ala His Pro Ala Ser Arg Ser 
65 70 75 80 

Phe Pro Asp Pro Arg Gly Leu Tyr His Phe Cys Leu Tyr Trp Asn Arg 
85 90 95 

His Ala Gly Arg Leu His Leu Leu Tyr Gly Lys Arg Asp Phe Leu Leu 
100 105 HO 

Ser Asp Lys Ala Ser Ser Leu Leu Cys Phe Gin His Gin Glu Glu Ser 
115 120 .125 

Leu Ala Gin Gly Pro Pro Leu Leu Ala Thr Ser Val Thr Ser Trp Trp 
130 135 140 

Ser Pro Gin Asn He Ser Leu Pro Ser. Ala Ala Ser Phe Thr Phe Ser 
145 150 155 160 

Phe His Ser Pro Pro His Thr Ala Ala His Asn Ala Ser Val Asp Met 
165 170 175 

Cys Glu Leu Lys Arg Asp Leu Gin Leu Leu Ser Gin Phe Leu Lys His 
180 185 190 

Pro Gin Lys Ala Ser Arg Arg Pro Ser Ala Ala Pro Ala Ser Gin Gin 
195 200 205 

Leu Gin Ser Leu Glu Ser Lys Leu Thr Ser Val Arg Phe Met Gly Asp 
210 215 220 

Met Val Ser Phe Glu Glu Asp Arg He Asn Ala Thr Val Trp Lys Leu 
225 230 235 240 

Gin Pro Thr Ala Gly Leu Gin Asp Leu His He His Ser Arg Gin Glu 
245 250 255 

Glu Glu Gin Ser Glu He Met Glu Tyr Ser Val Leu Leu Pro Arg Thr 
260 265 270 

Leu Phe Gin Arg Thr Lys Gly Arg Ser Gly Glu Ala Glu Lys Arg Leu 
275 280 285 



Leu Leu Val Asp Phe Ser Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser 
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290 295 300 

Ser His Val Leu Gly Glu Lys Val Leu Gly He Val Val Gin Asn Thr 
305 310 315 320 

Lys Val Ala Asn Leu Thr Glu Pro Val Val Leu Thr Phe Gin His Gin 
325 330 335 

Leu Gin Pro Lys Asn Val Thr Leu Gin Cys Val Phe Trp Val Glu Asp 
340 345 350 

Pro Thr Leu Ser Ser Pro Gly His Trp Ser Ser Ala Gly Cys Glu Thr 
355 360 365 

Val Arg Arg Glu Thr Gin Thr Ser Cys Phe Cys Asn His Leu Thr Tyr 
370 375 380 

Phe Ala Val Leu Met Val Ser Ser Val Glu Val Asp Ala Val His Lys 
385 390 395 400 

His Tyr Leu Ser Leu Leu Ser Tyr Val Gly Cys Val Val Ser Ala Leu 
405 410 415 

Ala Cys Leu Val Thr He Ala Ala Tyr Leu Cys Ser Arg Val Pro Leu 
420 425 430 

Pro Cys Arg Arg Lys Pro Arg Asp Tyr Thr He Lys Val His Met Asn 
435 440 445 

Leu Leu Leu Ala Val Phe Leu Leu Asp Thr Ser Phe Leu Leu Ser Glu 
450 455 ,460 

Pro Val Ala Leu Thr Gly Ser Glu Ala Gly Cys Arg Ala Ser Ala He 
465 470 475 480 

Phe Leu His Phe Ser Leu Leu Thr Cys Leu Ser Trp Met Gly Leu Glu 
485 490 495 

Gly Tyr Asn Leu Tyr Arg Leu Val Val Glu Val Phe Gly Thr Tyr Val 
500 505 510 

Pro Gly Tyr Leu Leu Lys Leu Ser Ala Met Gly Trp Gly Phe Pro He 
515 520 525 



Phe Leu Val Thr Leu Val Ala Leu Val Asp Val Asp Asn Tyr Gly Pro 
530 535 540 



He He Leu Ala Val His Arg Thr Pro Glu Gly Val He Tyr Pro Ser 
545 550 555 ~ 560 
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Met Cys Trp He Arg Asp Ser Leu Val Ser Tyr He Thr Asn Leu Gly 
565 570 • 575 



Leu Phe Ser Leu Val Phe Leu Phe Asn Met Ala Met Leu Ala Thr Met 
580 585 590 



Val Val Gin He Leu Arg Leu Arg Pro His Thr Gin Lys Trp Ser His 
595 600 605 



Val Leu Thr Leu Leu Gly Leu Ser Leu Val Leu Gly Leu Pro Trp Ala 
610 615 620 



Leu He Phe Phe Ser Phe Ala Ser Gly Thr Phe Gin Leu Val Val Leu 
625 630 635 640 



Tyr Leu Phe Ser He He Thr Ser Phe Gin Gly Phe Leu He Phe He 
645 650 ~ 655 



Trp Tyr Trp Ser Met Arg Leu Gin Ala Arg Gly Gly Pro Ser Pro Leu 
660 665 670 



Lys Ser Asn Ser Asp Ser Ala Arg Leu Pro He Ser Ser Gly Ser Thr 
675 680 685 



Ser Ser Ser Arg He 
690 



<210> 21 

<211> 1332 

<212> DNA 

<213> NM_001307 claudin 7, CLDN7 

<220> 

<221> CDS 

<222> (427) . . (1059) 

<223> 



<400> 21 






ctgctggctc 


acctccgagc cacctctgct gcgcaccgca gcctcggacc tacagcccag 


60 


gatactttgg 


gacttgccgg cgctcagaaa cgcgcccaga cggcccctcc accttttgtt 


120 


tgcctagggt 


cgccgagagc gcccggaggg aaccgcctgg ccttcgggga ccaccaattt 


180 


tgtctggaac 


caccctcccg gcgtatccta ctccctgtgc cgcgaggcca tcgcttcact 


240 


ggaggggtcg 


atttgtgtgt agtttggtga caagatttgc attcacctgg cccaaaccct 


300 


ttttgtctct 


ttgggtgacc ggaaaactcc acctcaagtt ttcttttgtg gggctgcccc 


360 
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ccaagtgtcg tttgttttac tgtagggtct cccgcccggc gcccccagtg ttttctgagg 420 

gcggaa atg gcc aat teg ggc ctg cag ttg ctg ggc ttc tec atg gee 468 
Met Ala Asn Ser Gly Leu Gin Leu Leu Gly Phe Ser Met Ala 
15 io 

ctg ctg ggc tgg gtg ggt ctg gtg gcc tgc acc gcc ate ccg cag tgg 516 
Leu Leu Gly Trp Val Gly Leu Val Ala Cys Thr Ala He Pro Gin Trp 
15 20 25 30 

cag atg age tec tat gcg ggt gac aac ate ate acg gcc cag gcc atg 564 
Gin Met Ser Ser Tyr Ala Gly Asp Asn He He Thr Ala Gin Ala Met 
35 40 45 

tac aag ggg ctg tgg atg gac tgc gtc acg cag age acg ggg atg atg 612 
Tyr Lys Gly Leu Trp Met Asp Cys Val Thr Gin Ser Thr Gly Met Met 
50 55 60 

age tgc aaa atg tac gac teg gtg etc gcc ctg tec gcg gcc ttg cag 660 
Ser Cys Lys Met Tyr Asp Ser Val Leu Ala Leu Ser Ala Ala Leu Gin 
65 70 75 

gcc act cga gcc eta atg gtg gtc tec ctg gtg ctg ggc ttc ctg gcc 708 
Ala Thr Arg Ala Leu Met Val Val Ser Leu Val Leu Gly Phe Leu Ala 
80 85 90 

atg ttt gtg gcc acg atg ggc atg aag tgc acg cgc tgt ggg gga gac 756 
Met Phe Val Ala Thr Met Gly Met Lys Cys Thr Arg Cys Gly Gly Asp 
95 100 105 * no 

gac aaa gtg aag aag gcc cgt ata gcc atg ggt gga ggc ata att ttc 804 
Asp Lys Val Lys Lys. Ala Arg He Ala Met Gly Gly Gly He He Phe 
115 120 125 

ate gtg gca ggt ctt gcc acc ttg gta get tgc tec tgg tat ggc cat 852 
He Val Ala Gly Leu Ala Thr Leu Val Ala Cys Ser Trp Tyr Gly His 
130 135 140 

cag att gtc aca gac ttt tat aac cct ttg ate cct acc aac att aag 900 
Gin He Val Thr Asp Phe Tyr Asn Pro Leu He Pro Thr Asn He Lys 
145 150 155 

tat gag ttt ggc cct gcc ate ttt att ggc tgg gca ggg tct gcc eta 948 
Tyr Glu Phe Gly Pro Ala He Phe He Gly Trp Ala Gly Ser Ala Leu 
160 165 170 

gtc ate ctg gga ggt gca ctg etc tec tgt tec tgt cct ggg aat gag 996 
Val He Leu Gly Gly Ala Leu Leu Ser Cys Ser Cys Pro Gly Asn Glu 
175 180 185 " 190 

age aag get ggg tac cgt gca ccc cgc tct tac cct aag tec aac tct 1044 
Ser Lys Ala Gly Tyr Arg Ala Pro Arg Ser Tyr Pro Lys Ser Asn Ser 
195 200 205 

tec aag gag tat gtg tgacctggga tctccttgcc ccagcctgac aggctatggg 1099 
Ser Lys Glu Tyr Val 
210 

agtgtctaga tgcctgaaag ggcctggggc tgagctcagc ctgtgggcag ggtgccggac 1159 

aaaggcctcc tggtcactct gtccctgcac tccatgtata gtcctcttgg gttgggggtg 1219 

ggggggtgcc gttggtggga gagacaaaaa gagggagagt gtgctttttg tacagtaata 1279 

aaaaataagt attgggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1332 
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<ZlU> 


22 


<211> 


211 


<212> 


PRT 


<213> 


NM_ 


<400> 


22 



Met Ala Asn Ser Gly Leu Gin Leu Leu Gly Phe Ser Met Ala Leu Leu 
1 5 10 15 



Gly Trp Val Gly Leu Val Ala Cys Thr Ala He Pro Gin Trp Gin Met 
20 25 • 30 



Ser Ser Tyr Ala Gly Asp Asn lie He Thr Ala Gin Ala Met Tyr Lys 
35 40 45 



Gly Leu Trp Met Asp Cys Val Thr Gin Ser Thr Gly Met Met Ser Cys 
50 55 60 



Lys Met Tyr Asp Ser Val Leu Ala Leu Ser- Ala Ala Leu Gin Ala Thr 
65 70 75 80 



Arg Ala Leu Met Val Val Ser Leu Val Leu Gly Phe Leu Ala Met Phe 
85 90 95 



Val Ala Thr Met Gly Met Lys Cys Thr Arg Cys Gly Gly Asp Asp Lys 
100 105 HO 



Val Lys Lys Ala Arg He Ala Met Gly Gly Gly He He Phe He Val 
115 120 125 



Ala Gly Leu Ala Thr ieu Val Ala Cys Ser Trp Tyr Gly His Gin He 
130 135 140 



Val Thr Asp Phe Tyr Asn Pro Leu He Pro Thr Asn He Lys Tyr Glu 
145 150 155 160 

Phe Gly Pro Ala He Phe He Gly Trp Ala Gly Ser Ala Leu Val He 
165 170 ' . 175 



Leu Gly Gly Ala Leu Leu Ser Cys Ser Cys Pro Gly Asn Glu Ser Lys 
180 185 * 190 



Ala Gly Tyr Arg Ala Pro Arg Ser Tyr Pro Lys Ser Asn Ser Ser Lys 
195 200 205 



Glu Tyr Val 
210 
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<210> 23 

<211> 888 

<212> DNA 

<213> NMJU4736 KIAA0101 gene product 

<220> 

<221> CDS 

<222> (75). .(407) 

<223> 



<400> 23 

ggggccttgt ccagtgaaac accctcggct gggaagtcag ttcgttctct cctctcctct 

cttcttgttt gaac atg gtg egg act aaa gca gac agt gtt cca ggc act 
Met Val Arg Thr Lys Ala Asp Ser Val Pro Gly Thr 
1 5 .10 

tac aga aaa gtg gtg get get cga gcc'ccc aga aag gtg ctt ggt tct 
Tyr Arg Lys Val Val Ala Ala Arg Ala Pro Arg Lys Leu Gly Ser 
15 20 25 

tec acc tct gee act aat teg aca tea gtt tea teg agg aaa get aaa 
Ser Thr Ser Ala Thr Asn Ser Thr Ser Val Ser Ser Ar| Lys £a Glu 

35 40 

aat aaa tat gca gga ggg aac ccc gtt tgc gtg cgc cca act ccc aaa 
Asn Lys Tyr Ala Gly Gly Asn Pro Val Cys Val Arg Pro ?£ Pro Lyl 
45 50 - 55 60 

r?n t 333 ^f 3 « ' 993 933 ttC ttfc a ^ tta tcc c <=t aaa gat tct 
Trp Gin Lys Gly lie Gly Glu Phe Phe Arg Leu Ser Pro Lys Lp Ser 

65 70 75 

gaa aaa gag aat cag att cct gaa gag gca gca aac aat aao n-= m = 
Glu Lys Glu Asn Gin He Pro Glu Glu Ila lly sir Ser lly £ lly 
80 85 go 

fit f? 3 f 39 f 33 9Ca t9t cct tt » caa cot gat cac aca aat gat 
Lys Ala Lys Arg Lys Ala Cys Pro Leu Gin Pro Asp His Thr Asn i£p 
95 100 a o5 

gaa aaa gaa tagaactttc tcattcatct ttgaataacg tctccttgtt 
Glu Lys Glu 
110 

taccctggta ttctagaatg taaatttaca taaatgtgtt tgttccaatt agctttgttg 
aacaggcatt taattaaaaa atttaggttt aaatttagat gttcaaaagt agttgtgaaa 
tttgagaatt tgtaagacta attatggtaa cttagcttag tattcaatat aatgcattgt 
ttggtttctt ttaccaaatt aagtgtctag ttcttgctaa aatcaagtca ttgcattgtg 
ttctaattac aagtatgttg tatttgagat ttgettagat tgttgtactg ctgccatttt 
tattggtgtt tgattattgg aatggtgcca tattgtcact ccttctactt gctttaaaaa 
gcagagttag atttttgeae attaaaaaaa ttcagtatta attaaacact aaaaaaaaaa 
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aaaaaaaaaa aaaaaaaaaa a 



888 



<210> 24 

<211> 111 

<212> PRT 

<213> NM_014736 KIAA0101 gene product 

<400> 24 

Met Val Arg Thr Lys Ala Asp Ser Val Pro Gly Thr Tyr Arg Lys Val 
1 5 10 15 

Val Ala Ala Arg Ala Pro Arg Lys Val Leu Gly Ser Ser Thr Ser Ala 
20 25 30 

Thr Asn Ser Thr Ser Val Ser Ser Arg Lys Ala Glu Asn Lys Tyr Ala 
35 40 45 



Gly Gly Asn Pro Val Cys Val Arg Pro Thr Pro Lys Trp Gin Lys Gly 
50 55 60 . 

He Gly Glu Phe Phe Arg Leu Ser Pro Lys Asp Ser Glu Lys Glu Asn 
65 70 75 80 

Gin He Pro Glu Glu Ala Gly Ser Ser Gly Leu Gly Lys Ala Lys Arg 
85 90 , 95 



Lys Ala Cys Pro Leu Gin Pro Asp His Thr Asn Asp Glu Lys Glu 
100 105 no 



<210> 25 

<211> 598 

<212> DNA 

<213> NM_003064 secretory leukocyte protease inhibitor, SLPI 



<220> 

<221> CDS 

<222> (23).. (418) 

<223> 



<400> 25 

cagagtcact cctgccttca cc atg aag tec age ggc etc ttc ccc ttc ctg 52 

Met Lys Ser Ser Gly Leu Phe Pro Phe Leu 
1 5 10 
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gtg ctg ctt gcc ctg gga act ctg gca cct tgg get gtg gaa ggc tct 100 
Val Leu Leu Ala Leu Gly Thr Leu Ala Pro Trp Ala Val Glu Gly Ser 
15 20 25 

gga aag tec ttc aaa get gga gtc tgt cct cct aag aaa tct gcc cag 148 
Gly Lys Ser Phe Lys Ala Gly Val Cys Pro Pro Lys Lys Ser Ala Gin 
30 35 40 

tgc ctt aga tac aag aaa cct gag tgc cag agt gac tgg cag tgt cca 196 
. Cys Leu Arg Tyr Lys Lys Pro Glu Cys Gin Ser Asp Trp Gin Cys Pro 
45 50 55 

ggg aag aag aga tgt tgt cct gac -act tgt ggc ate aaa tgc ctg gat 244 
Gly Lys Lys Arg Cys Cys Pro Asp Thr Cys Gly He Lys Cys Leu Asp 
60 65 70 

cct gtt gac ace cca aac cca aca agg agg aag cct ggg aag tgc cca 292 
Pro Val Asp Thr Pro Asn Pro Thr Arg Arg Lys Pro Gly Lys Cys Pro 
75 80 85 90 

gtg act tat ggc caa tgt ttg atg ctt aac ccc ccc aat ttc tgt gag 340 
Val Thr Tyr Gly Gin Cys Leu Met Leu Asn Pro Pro Asn Phe Cys Glu 
95 100 105 

atg gat ggc cag tgc aag cgt gac ttg aag tgt tgc atg ggc atg tgt 388 
Met Asp Gly Gin Cys Lys Arg Asp Leu Lys Cys Cys Met Gly Met Cys 
HO 115 120 

ggg aaa tec tgc gtt tec cct gtg aaa get tgattcctgc catatggagg 438 
Gly Lys Ser Cys Val Ser Pro Val Lys Ala 
125 130 

aggctctgga gtcctgctct gtgtggtcca ggtcctttcc accctgagac ttggctccac 498 
cactgatatc ctcctttggg gaaaggcttg gcacacagca ggctttcaag aagtgccagt 558 
tgatcaatga ataaataaac gagectattt ctctttgcac 598 

<210> 26 

<211> 132 

<212> PRT 

<213> NMJ>03064 secretory leukocyte protease inhibitor, SLPI 



<400> 26 

Met Lys Ser Ser Gly Leu Phe Pro Phe Leu Val Leu Leu Ala Leu Gly 
1 5 10 15 

Thx Leu Ala Pro Trp Ala Val Glu Gly Ser Gly Lys Ser Phe Lys Ala 
20 25 30 

Gly Val Cys Pro Pro Lys Lys Ser Ala Gin Cys Leu Arg Tyr Lys Lys 
35 40 45 



Pro Glu Cys Gin Ser Asp Trp Gin Cys Pro Gly Lys Lys Arg Cys Cys 
50 55 -60 



0 
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Pro Asp Thr Cys Gly lie Lys Cys Leu Asp Pro Val Asp Thr Pro Asn 



70 7S 



80 



Pro Thr Arg Arg Lys Pro Gly Lys Cys Pro Val Thr Tyr Gly Gin Cys 
85 go 95 

Leu Met Leu Asn Pro Pro Asn Phe Cys Glu Met Asp Gly Gin Cys Lys 
100 105 . 110 

Arg Asp Leu Lys Cys Cys Met Gly Met Cys Gly Lys Ser Cys Val Ser 
115 120 .125 

Pro Val Lys Ala 
130 



<210> 27 

<211> 3970 

<212> DNA 

<213> NM 013994 DDR1 



<220> 

<221> CDS 

<222> (337) . . (3093) 

<223> 



<400> 27 

ggcttaggaa gtattaactg atctctgccc tagttctcat gtgttaaata tggatagtaa 

tagtatctac cttatgaagt gactgtgaag ataaaattat ggattctgtt taagggttta 

ggccagtgto tggcacaggg gaagoattct aaaaatatag ctgatgctgt taaacaatga 

ctgttgttgt tgttttactg ttattatccc caaagoggcc cattctgtct gttgctgtca 240 

gctatgactc agtcccctga ttaacttacg caccacccat tttatcccct gcagagatgc 300 

tgcccccaoc cccttaggcc cgagggatca ggagct atg gga cca gag gcc ctg 354 

Met Gly Pro Glu Ala Leu 
1 5 

Ser tit ri« f 9 r tg T ° tg Ct ° ttg gt9 gCa agt gat get gac atg 
Ser Ser Leu Leu Leu Leu Leu Leu Val Ala Ser Gly Asp Ala Asp Met 

10 15 20 

?l 9 ?f a = at gat cct 9 CC aa 9 cgc tat gcc ctg ggc atg cag 
Lys Gly His Phe Asp Pro Ala Lys Cys Arg Tyr Ala Leu Gly Me? Gin 
25 30 35 



60 
120 
180 



402 



450 



gac egg ace ate cca gac agt gac ate tct get tec age tec tgg tea 498 
Asp Arg Thr He Pro Asp Ser Asp lie Ser Ala Ser Ser Ser Tr? Ser 
40 45 50 

gat tec act gcc gcc cgc cac age agg ttg gag age agt gac ggg gat 546 
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61 



90 95 



100 



cag gga egg cat gec ggg ggc ctg ggc aag gag ttc tec egg age tac 
Gin Gly Arg His Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr 
105 no 



115 



egg ctg cgt tac tec egg gat ggt cgc cgc tgg atg ggc tgg aag gac 
Arg Leu Arg Tyr Ser Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp 

120 m . 125 



130 



245 



250 2 55 



260 
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Asp Ser Thr Ala Ala Arg His Ser Arg Leu Glu Ser Ser Asp Gly Asp 
bb 60 65 70 

?f C ^ 9 CC ° 9Ca " g tcg gtg ttt: ccc a ag 9^ gag gag tac 594 
Gly Ala Trp Cys Pro Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr 
75 80 85 

ttg cag gtg gat eta caa cga ctg cac ctg gtg get ctg gtg ggc acc 
Leu Gin Val Asp Leu Gin Arg Leu His Leu Val Ala Leu Val Gly Thr 



642 



690 



738 



786 



cgc tgg ggt cag gag gtg ate tea ggc aat gag gac cct gag gga gtg 
Arg Trp Gly Gin Glu Val lie Ser Gly Asn Glu Asp Pro Glu Gly Vai 
135 140 145 150 

gtg ctg aag gac ctt ggg ccc ccc atg gtt gec cga ctg gtt cgc ttc 
Val Leu Lys Asp Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe 
155 160 . 165 

tac ccc egg get gac egg gtc atg age gtc tgt ctg egg gta gag etc 
Tyr Pro Arg Ala Asp Arg Val Met Ser Val Cys Leu Arg Val III Leu 
170 175 iso 

tat ggc tgc etc tgg agg gat gga etc ctg. tct tac acc gec cct gtg 
Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val 
!85 190 195 

p? 9 nf 9 2? at9 tat tta tct gag * cc -gtg tac etc aac gac tec acc 
Gly Gin Thr Met Tyr Leu Ser Glu Ala Val Tyr Leu Asn Lp Ser Tnr 
200 205 210 

tat gac gga cat acc gtg ggc gga ctg cag tat ggg ggt ctg ggc cag 1026 
Tyr Asp Gly His Thr Val Gly Gly Leu Gin Tyr liy Hy Leu lly Gin 
215 220 225 230 

ctg gca gat ggt gtg gtg ggg ctg gat gac ttt agg aag agt cag gag 
Leu Ala Asp Gly Val Val Gly Leu Asp Asp Phe Arg Lys Ser Gin Glu 
235 240 



834 



882 



930 



978 



1074 



ill III 5*? t 99 n° a 9 ?° tat 9aC tat gtg 'gga tgg age aac cac age 1122 
Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val Gly Trp Ser Asn His Ser 



ttc tec agt ggc tat gtg gag atg gag ttt gag ttt gac egg ctg agg 1170 
Phe Ser Ser Gly Tyr Val Glu Met Glu Phe Glu Phe Asp Arg Leu Arg 
265 270 275 

??° lt C n 39 atg ° ag gt ° Cac tgt aac aac at g cac acg ctg gga 1218 
Ala Phe Gin Ala Met Gin Val His Cys Asn Asn Met His Thr Leu Gly 
280 285 290 

gec cgt ctg cct ggc ggg gtg gaa tgt cgc ttc egg cgt ggc cct gec 1266 
Ala Arg Leu Pro Gly Gly Val Glu Cys Arg Phe Arg Arg Gly Pro Ala 
295 300 305 310 

£? III rf 9 r? 9 2? 9 l° C atg CgC ° aC 330 Cta ggg ggC aac ctg 1314 
Met Ala Trp Glu Gly Glu Pro Met Arg His Asn Leu Gly Gly Asn Leu 

315 320 325 



WO 2004/022778 



62 



PCT/AU2003/001166 



ggg gac ccc aga gcc egg get gtc tea gtg ccc ctt ggc ggc cgt gtg 1362 
Gly Asp Pro Arg Ala Arg Ala Val Ser Val Pro Leu Gly Gly Arg Val 
330 335 340 

get cgc ttt ctg cag tgc cgc ttc etc ttt gcg ggg ccc tgg tta etc 1410 
Ala Arg Phe Leu Gin Cys Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu 
345 350 355 

ttc age gaa ate tec ttc ate tct gat gtg gtg aac aat tec tct ccg 1458 
Phe Ser Glu lie Ser Phe lie Ser Asp Val Val Asn Asn Ser Ser Pro 
360 365 . 370 

gca ctg gga ggc acc ttc ccg cca gcc ccc tgg tgg ccg cct ggc cca 1506 
Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro 
' 375 380 385 390 

cct ccc acc aac ttc age age ttg gag ctg gag ccc aga ggc cag cag 1554 
Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin Gin 
395 400 ' 405 

ccc gtg gcc aag gcc gag ggg age ccg acc gcc ate etc ate ggc tgc 1602 
Pro Val Ala Lys Ala Glu Gly Ser Pro Thr Ala lie Leu He Gly Cys 
410 415 420 

ctg gtg gcc ate ate ctg etc ctg ctg etc ate att gcc etc atg etc 1650 
Leu Val Ala He He Leu Leu Leu Leu Leu He He Ala Leu Met Leu 
425 430 435 

tgg egg ctg cac tgg cgc agg etc etc age aag get gaa egg agg gtg 1698 
Trp Arg Leu His Trp Arg Arg Leu Leu Ser Lys Ala Glu Arg Arg Val 
440 445 450 



ttg gaa gag gag ctg acg gtt cac etc tct gtc cct ggg gac act ate 1746 
Leu Glu Glu Glu Leu Thr Val His Leu Ser Val Pro Gly Asp Thr He 
455 460 465 470 

etc ate aac aac cgc cca ggt cct aga gag cca ccc ccg tac cag gag 1794 
Leu He Asn Asn Arg Pro Gly Pro Arg :Glu .Pro Pro Pro Tyr Gin Glu 
475 480 485 

ccc egg cct cgt ggg aat ccg ccc cac tec get ccc tgt gtc ccc aat 1842 
Pro Arg Pro Arg Gly Asn Pro Pro His Ser Ala Pro Cys Val Pro Asn 
490 495 500 

ggc tct gcg ttg ctg etc tec aat cca gcc tac cgc etc ctt ctg gcc 1890 
Gly Ser Ala Leu Leu Leu Ser Asn Pro Ala Tyr Arg Leu Leu Leu Ala 
505 510 515 

act tac gcc cgt ccc cct cga ggc ccg ggc ccc ccc aca ccc gcc tgg 1938 
Thr Tyr Ala Arg Pro Pro Arg Gly Pro Gly Pro Pro Thr Pro Ala Trp 
520 525 530 

gcc aaa ccc acc aac acc cag gcc tac agt ggg gac tat atg gag cct 1986 
Ala Lys Pro Thr Asn Thr Gin Ala Tyr Ser Gly Asp Tyr Met Glu Pro 
535 540 545 550 

gag aag cca ggc gcc ccg ctt ctg ccc cca cct ccc cag aac age gtc 2034 
Glu Lys Pro Gly Ala Pro Leu Leu Pro Pro Pro Pro Gin Asn Ser Val 
555 560 565 

ccc cat tat gcc gag get gac att gtt acc ctg cag ggc gtc acc ggg 2082 
Pro His Tyr Ala Glu Ala Asp lie Val Thr Leu Gin Gly Val Thr Gly 
570 575 580 
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ggc aac acc tat get gtg cct gca ctg ccc cca ggg gC a gtc ggg gat 2130 
Gly Asn Thr Tyr Ala Val Pro Ala Leu Pro Pro Gly Ila Vai 61? Asp 
585 590 595 

ggg ccc ccc aga gtg gat ttc cct cga tct cga etc cgc ttc aag gag 2178 
Gly Pro Pro Arg Val Asp Phe Pro Arg Ser Arg Leu Arg Phe Lys Glu 
600 605 



610 



ggc gag ggc cag ttt ggg gag gtg cac ctg tgt gag gtc gac 2226 
Lys Leu Gly Glu Gly Gin Phe Gly Glu Val His Leu Cys Glu Val Asp 
615 620 625 630 

w p Ct ^ a ? at Ctg gtt a9t Ctt gat ttc ccc ctt aat «tg cgt aag 2274 
Ser Pro Gin Asp Leu Val Ser Leu Asp Phe Pro. Leu Asn Val Arg Lys 
635 640 64 1 

gga cac cct ttg ctg gta get gtc aag ate tta egg cca gat gee acc 2322 
Gly firs Pro Leu Leu Val Ala Val Lys lie Leu Arg Pro Asp lla Thr 
650 655 660 

aag aat gee age ttc tec ttg ttc tec agg aat gat ttc ctg aaa gag 
Lys Asn Ala Ser Phe Ser Leu Phe Ser Arg Asn Asp Phe Leu Lys Glu 
665 670 



675 



710 



740 



2370 



2418 



gtg aag ate atg teg agg etc aag gac cca aac ate att egg ctg ctg 
Val Lys He Met Ser Arg Leu Lys Asp Pro Asn He He Arg Leu Leu 
680 685 690 • 

G?v f? ^ g ? a ° gaC CCC Ct ° tgc atg att act g ac tac atg 2466 

Gly Val Cys Val Gin Asp Asp Pro Leu Cys Met He Thr Asp Tyr Met 
695 700 705 



2514 



gag aac ggc gac etc aac cag ttc etc agt gee cac cag ctg gag gac 
Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser Ala His Gin Leu Glu Asp 
715 720 725 

III a?! J?! 2? g g f C ° Ct ggg gaC ggg Cag gct gcg ca * 9W *cc 2562 

Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly Gin Ala Ala Gin Gly Pro 
730 735 



^ agc £ ac ^ ca at * ct 9 ct 9 ^t gtg gca gee cag ate gee tec 2610 
Thr He Ser Tyr Pro Met Leu Leu His Val Ala Ala Gin He Ala Ser 
745 750 755 

ggc atg cgc tat ctg gee aca etc aac ttt' gta cat egg gac ctg gee 2658 
Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe Val His A?g Asp Leu Ma 
760 765 770 

acg egg aac tgc eta gtt ggg gaa aat ttc acc ate aaa ate gca gac 2706 
Thr Arg Asn Cys Leu Val Gly Glu Asn Phe Thr He Lys lie Ala Asp 
775 780 785 790 

Ph! r?w ttl c 9C ^ 9 aa ° CtC tat gCt ggg gac tat tac <* fc «tg cag 2754 
Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly Asp Tyr Tyr Arg Val Gin 

7 95 800 805 

ggc egg gca gtg ctg ccc ate cgc tgg atg gee tgg gag tgc ate etc 2802 
Gly Arg Ala Val Leu Pro He Arg Trp Met Ala Trp Glu Cys He Leu 
810 815 820 

utl rtl til ph C ^ ^ g ? 9 agt ga ° tgg gcc tt:t ggt gtg acc 2850 
Met Gly Lys Phe Thr Thr Ala Ser Asp Val Trp Ala Phe Gly Val Thr 

825 830 835 



ctg tgg gag gtg ctg atg etc tgt agg gcc cag ccc ttt ggg tea gct 
Leu Trp Glu Val Leu Met Leu Cys Arg Ala Gin Pro Phe Gly Ser Ala 



2898 
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840 



845 



850 



cac cga cga gca ggt cat cga gaa cgc ggg gga gtt ctt ccg gga cca 2946 

His Arg Arg Ala Gly His Arg Glu Arg Gly Gly Val Leu Pro Gly Pro 
855 860 865 870 

ggg ccg gca gtg tac ctg tec egg- ccg cct gec tgc ccg cag ggc eta 2994 

Gly Pro Ala Val Tyr Leu Ser Arg Pro Pro Ala Cys Pro Gin Gly Leu 
875 880 885 

tat gag ctg atg ctt egg tgc tgg age egg gag tct gag cag cga cca . 3042 

Tyr Glu Leu Met Leu Arg Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro 
890 895 900 

ccc ttt tec cag ctg cat egg ttc ctg gca gag gat gca etc aac acg 3090 

Pro Phe Ser Gin Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn Thr 

905 910 915 

gtg tgaatcacac atccagctgc ccctccctca gggagegate caggggaagc 3143 
Val 



cagtgacact 


aaaacaagag 


gacacaatgg 


cacctctgcc cttcccctcc cgacagccca 


3203 


tcacctctaa 


tagaggcagt 


gagactgeag 


gtgggctggg cccacccagg gagctgatgc 


3263 


cccttctccc 


cttcctggac 


acactctcat 


gtccccttcc tgttcttcct tcctagaagc 


3323 


ccccctgtcg 


cccacccagc 


tggtcctgtg 


gatgggatcc tctccaccct cctctagcca 


3383 


tcccttgggg 


aagggtgggg 


agaaatatag 


gatagacact ggacatggcc cattggagca 


3443 


cctgggcccc 


actggacaac 


actgattcct 


ggagaggtgg ctgcgccccc agcttctctc 


3503 


tccctgtcac 


acactggacc 


ccactggctg 


agaatctggg ggtgaggagg acaagaagga 


3563 


gaggaaaatg 


tttccttgtg 


cctgctcctg 


tacttgtcct cagcttgggc ttcttcctcc 


3623 


tccatcacct 


gaaacactgg 


acctgggggt 


agccccgccc cagccctcag tcacccccac 


3683 


ttcccacttg 


cagtcttgta 


gctagaactt 


ctctaagcct ataegtttet gtggagtaaa 


374-3 


tattgggatt 


ggggggaaag 


agggagcaac 


ggcccatagc cttggggttg gacatctcta 


3803 


gtgtagctgc 


cacattgatt 


tttctataat 


cacttggggt ttgtacattt ttggggggag 


3863 


agacacagat 


ttttacacta 


atatatggac 


ctagcttgag gcaattttaa tcccctgcac 


3923 


taggcaggta 


ataataaagg 


ttgagttttc 


cacaaaaaaa aaaaaaa 


3970 



<210> 28 

<211> 919 

<212> PRT 

<213> NM 013994 DDR1 



<400> 28 

Met Gly Pro Glu Ala Leu Ser Ser 

1 ' 5 



Leu Leu Leu Leu Leu Leu Val Ala 
10 15 



Ser" Gly Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg 
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20 25 



30 



Tyr Ala Leu Gly Met Gin Asp Arg ..Thr lie Pro Asp Ser Asp He Ser 
35 40 45 

Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arg Leu 
50 55 .60 

Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser Val Phe 
65 70 75 80 

Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp Leu Gin Arg Leu His Leu 
85 go 95 

Val Ala Leu Val Gly Thr Gin Gly Arg His Ala Gly Gly Leu Gly Lys 
100 a05 no 

Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser Arg Asp Gly Arg Arg 
115 120 125 

Trp Met Gly Trp Lys Asp Arg Trp Gly Gin Glu Val He Ser Gly Asn 
1J0 135 140 

Glu Asp Pro Glu Gly Val Val Leu Lys Asp Leu Gly Pro Pro Met Val 
145 150 155 i 6 o 

Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala Asp Arg Val Met Ser Val 
165 i7 Q 1?5 

Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu 
I 80 185 190 

Ser Tyr Thr Ala Pro Val Gly Gin Thr Met Tyr Leu Ser Glu Ala Val 
. I 95 200 205 

Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gin 
210 215 220 

Tyr Gly Gly Leu Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp 
225 230 235 240 

Phe Arg Lys Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val 
245 250 255 

Gly Trp Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met Glu Phe 
260 265 270 

Glu Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val His Cys Asn 
27 5 280 285 



Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly Val Glu Cys Arg 
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290 295 300 

Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met Arg His 
305 310 315 320 

Asn Leu Gly Gly Asn Leu Gly Asp Pro; Arg Ala Arg Ala Val Ser Val 
325 330 335 

Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gin Cys Arg Phe Leu Phe 
340 345 350 

Ala Gly Pro Trp Leu Leu Phe Ser Glu lie Ser Phe lie Ser Asp Val 
355 360 365 

Val Asn Asn Ser Ser Pro Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro 
370 375 380 

Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu 
385 390 395 400 



Glu Pro Arg Gly Gin Gin Pro Val Ala Lys Ala Glu Gly . Ser Pro Thr 
405 410 415 



Ala lie Leu lie Gly Cys Leu Val Ala lie lie Leu Leu Leu Leu Leu 
420 425 ■ 430 



lie lie Ala Leu Met Leu Trp Arg Leu His Trp Arg Arg Leu Leu Ser 
435 440 445 



Lys Ala Glu Arg Arg Val Leu Glu Glu Glu Leu Thr Val His Leu Ser 
450 455 460 



Val Pro Gly Asp Thr He Leu He Asn Asn Arg Pro Gly Pro Arg Glu 
465 470 475 480 



Pro Pro Pro Tyr Gin Glu Pro Arg Pro Arg Gly Asn Pro Pro His Ser 
485 490 495 



Ala Pro Cys Val Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn Pro Ala 
500 505 510 



Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg Gly Pro Gly 
515 520 525 



Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr Gin Ala Tyr Ser 
530 535 540 



Gly Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala Pro Leu Leu Pro Pro 
545 550 555 560 
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Pro Pro Gin Asn Ser Val "Pro His Tyr Ala Glu Ala Asp lie Val Thr 
565 • 570 575 

Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr Ala Val Pro Ala Leu Pro 
580 585 590 

Pro Gly Ala Val Gly Asp Gly Pro Pro Arg Val Asp Phe Pro Arg Ser 
595 600 • 605 

Arg Leu Arg Phe Lys Glu Lys Leu Gly Glu Gly Gin Phe Gly Glu Val 
610 615 "* 620 

His Leu Cys Glu Val Asp Ser Pro Gin Asp Leu Val Ser Leu Asp Phe 
625 630 635 640 

Pro Leu Asn Val Arg Lys Gly His Pro Leu Leu Val Ala Val Lys He 
645 650 655 

Leu Arg Pro Asp Ala Thr Lys Asn Ala Ser Phe Ser Leu Phe Ser Arg 
660 665 670 

Asn Asp Phe Leu Lys Glu Val Lys He Met Ser Arg Leu Lys Asp Pro 
67 5 680 685 

Asn He He Arg Leu Leu Gly Val Cys Val Gin Asp Asp Pro Leu Cys 
690 695 700 

Met He Thr Asp Tyr Met Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser 
705 710 715 720 

Ala His Gin Leu Glu Asp Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly 
725 730 735 

Gin Ala Ala Gin Gly Pro Thr He Ser Tyr Pro Met Leu Leu His Val 
740 745 750 

Ala Ala Gin He Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe 
755 760 765 

Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe 
770 775 780 

Thr He Lys He Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly 
785 790 795 800 

Asp Tyr Tyr Arg Val Gin Gly Arg Ala Val Leu Pro He Arg Trp Met 
80S 810 815 



Ala Trp Glu Cys He Leu Met Gly Lys Phe Thr Thr Ala Ser Asp Val 
820 825 830 
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Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys Arg Ala 
835 840 845 

Gin Pro Phe Gly Ser Ala His Arg Arg Ala Gly His Arg Glu Arg Gly 
850 855 ~ 860 

Gly Val Leu Pro Gly Pro Gly Pro Ala .Val Tyr Leu Ser Arg Pro Pro 
865 870. ■ 875 880 

Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met Leu Arg Cys Trp Ser Arg 
885 890 " " 895 

Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin Leu His Arg Phe Leu Ala 
900 905 910 



Glu Asp Ala Leu Asn Thr Val 
915 



<210> 29 

<211> 5698 

<212> DNA 

<213> NMJJ01067 TOP2A 



<220> 

<221> CDS 

<222> (127) (4719) 

<223> 



<400> 29 

aggttcaagt ggagctctcc taaccgacgc gcgtctgtgg agaagcggct tggtcggggg 60 

tggtctcgtg gggtcctgcc tgtttagtcg ctttcagggt tcttgagccc cttcacgacc 120 

gtcacc atg gaa gtg tea cca ttg cag cct gta aat gaa aat atg caa 
Met Glu Val Ser Pro Leu Gin Pro Val Asn Glu Asn Met Gin 
1 5 10 



168 



gtc aac aaa ata aag aaa aat gaa gat get aag aaa aga ctg tct gtt 216 
Val Asn Lys He Lys Lys Asn Glu Asp Ala Lys Lys Arg Leu Ser Val 
15 20 25 " 30 

gaa aga ate tat caa aag aaa aca caa ttg gaa cat att ttg etc cgc 264 
Glu Arg He Tyr Gin Lys Lys Thr Gin Leu Glu His He Leu Leu Arg 
35 40 45 



cca gac acc tac att ggt tct gtg gaa tta gtg acc cag caa atg tgg 312 
Pro Asp Thr Tyr He Gly Ser Val Glu Leu Val Thr Gin Gin Met Trp 
50 55 60 

gtt tac gat gaa gat gtt ggc att aac tat agg gaa gtc act ttt gtt . 360 
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Val Tyr Asp Glu Asp Val Gly lie Asn Tyr Arg Glu Val Thr Phe Val 
65 70 75 



cct ggt ttg tac aaa ate ttt gat gag att eta gtt aat get gcg gac 
Pro Gly Leu Tyr Lys He Phe Asp Glu He Leu Val Asn Ala Ala Asp 
80 85 90 



atg agt gaa aaa ggc ttt cag caa att age ttt gtc aac age att get 
Met Ser Glu Lys Gly Phe Gin Gin He Ser Phe Val Asn Ser He Ala 
305 310 315 



408 



aac aaa caa agg gac cca aaa atg tct tgt att aga gtc aca att gat 456 
Asn Lys Gin Arg Asp Pro Lys Met Ser Cys He Arg Val Thr He Asp 
95 100 105 no 

ccg gaa aac aat tta att agt ata tgg aat aat gga aaa ggt att cct 504 
Pro Glu Asn Asn Leu He Ser He Trp Asn Asn Gly Lys Gly He Pro 
115 120 125 

gtt gtt gaa cac aaa gtt gaa aag atg tat gtc cca get etc ata ttt 552 
Val Val Glu His Lys Val Glu Lys Met Tyr Val Pro Ala Leu He Phe 
130 135 140 

gga cag etc eta act tct agt aac tat gat gat gat gaa aag aaa gtg 600 
Gly Gin Leu Leu Thr Ser Ser Asn Tyr Asp Asp Asp Glu Lys Lys Val 
145 iso 155 

aca ggt ggt cga aat ggc tat gga gec aaa ttg tgt aac ata ttc agt 648 
Thr Gly Gly Arg Asn Gly Tyr Gly Ala Lys Leu Cys Asn He Phe Ser 
16 ° 165 170 

ace aaa ttt act gtg gaa aca gee agt aga gaa tac aag aaa atg ttc 696 
Thr Lys Phe Thr Val Glu Thr Ala Ser Arg Glu Tyr Lys Lys Met Phe 
175 180 185 * 190 

aaa cag aca tgg atg gat aat atg gga aga get ggt gag atg gaa etc 744 
Lys Gin Thr Trp Met Asp Asn Met Gly Arg Ala Gly Glu Met Glu Leu 
195 200 205 

aag ccc ttc aat gga gaa gat tat aca tgt ate ace ttt cag cct gat 792 
Lys Pro Phe Asn Gly Glu Asp Tyr Thr Cys He Thr Phe Gin Pro Asp 
210 215 220 

ttg tct aag ttt aaa atg caa age ctg gac aaa gat att gtt gca eta 840 
Leu Ser Lys Phe Lys Met Gin Ser Leu Asp Lys Asp He Val Ala Leu 
225 230 235 

atg gtc aga aga gca tat gat att get gga tec ace aaa gat gtc aaa 888 
Met Val Arg Arg Ala Tyr Asp He Ala Gly Ser Thr Lys Asp Val Lys 
240 245 250 

gtc ttt ctt aat gga aat aaa ctg cca gta aaa gga ttt cgt agt tat 936 
Val Phe Leu Asn Gly Asn Lys Leu Pro Val Lys Gly Phe Arg Ser Tyr 
255 260 265 270 

gtg gac atg tat ttg aag gac aag ttg gat gaa act ggt aac tec ttg 984 
Val Asp Met Tyr Leu Lys Asp Lys Leu Asp Glu Thr Gly Asn Ser Leu 
27 5 280 285 

aaa gta ata cat gaa caa gta aac cac agg tgg gaa gtg tgt tta act 1032 
Lys Val He His Glu Gin Val Asn His Arg Trp Glu Val Cys Leu Thr 
290 295 300 



1080 



aca tec aag ggt ggc aga cat gtt gat tat gta get gat cag att gtg 
Thr Ser Lys Gly Gly Arg His Val Asp Tyr Val Ala Asp Gin He Val 
320 325 330 



1128 
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act aaa ctt gtt gat gtt gtg aag aag aag aac aag ggt ggt gtt gca 
Thr Lys Leu Val Asp Val Val Lys Lys Lys Asn Lys Gly Gly Val Ala 
335 340 345 350 

gta aaa gca cat cag gtg aaa aat cac atg tgg att ttt gta aat gcc 
Val Lys Ala His Gin Val Lys Asn His Met Trp He Phe Val Asn Ala 
355 . 360 365 

tta att gaa aac cca acc ttt gac tct cag aca aaa gaa aac atg act 
Leu He Glu Asn Pro Thr Phe Asp Ser Gin Thr Lys Glu Asn Met Thr 
370 375 - 380 

tta caa ccc aag age ttt gga tea aca tgc caa ttg agt gaa aaa ttt 
Leu Gin Pro Lys Ser Phe Gly Ser Thr Cys Gin Leu Ser Glu Lys Phe 
385 390 395 

ate aaa get gcc att ggc tgt ggt att gta gaa age ata eta aac tgg 
He Lys Ala Ala He Gly Cys Gly He Val Glu Ser He Leu Asn Trp 
400 405 410 

gtg aag ttt aag gcc caa gtc cag tta aac aag aag tgt tea get gta 
Val Lys Phe Lys Ala Gin Val- Gin Leu Asn Lys Lys Cys Ser Ala Val 
415 420 



425 430 



aaa cat aat aga ate aag gga att ccc aaa etc gat gat gcc aat gat 
Lys His Asn Arg He Lys Gly He Pro Lys Leu Asp Asp Ala Asn Asp 
435 440 445 

gca ggg ggc cga aac tec act gag tgt acg ctt ate ctg act gag gga 
Ala Gly Gly Arg Asn Ser Thr Glu Cys Thr Leu He Leu Thr Glu Glv 
450 455 460 

gat tea gcc aaa act ttg get gtt tea ggc ctt ggt gtg gtt ggg aga 
Asp Ser Ala Lys Thr Leu Ala Val Ser Gly Leu Gly Val Val Gly Arg 
465 470 475 

gac aaa tat ggg gtt ttc cct ctt aga gga aaa ata etc aat gtt cga 
Asp Lys Tyr Gly Val Phe Pro Leu Arg Gly Lys He Leu Asn Val Arg 
480 485 



490 



I °f aag Cag atc atg gaa a *t.gct gag att aac aat ate 
Glu Ala Ser His Lys Gin He Met Glu. Asn Ala Glu He Asn Asn He 
495 500 505 510 

atc aag att gtg ggt ctt cag tac aag aaa aac tat gaa gat gaa gat 
He Lys He Val Gly Leu Gin Tyr Lys Lys Asn Tyr Glu Asp Glu Asp 
515 520 525 

tea ttg aag acg ctt cgt tat ggg aag . ata atg att atg aca gat cag 
Ser Leu Lys Thr Leu Arg Tyr. Gly Lys He Met He Met Thr Asp Gin 
530 535 540 

gac caa gat ggt tec cac atc aaa ggc ttg ctg att aat ttt atc cat 
Asp Gin Asp Gly Ser His He Lys Gly Leu Leu He Asn Phe lie His 
545 550 555 



cac aac tgg ccc tct ctt ctg cga cat cgt ttt ctg gag gaa ttt atc 
His Asn Trp Pro Ser Leu Leu Arg His Arg Phe Leu Glu Glu Phe He 
560 565 570 

act ccc att gta aag gta tct aaa aac aag caa gaa atg gca ttt tac 
Thr Pro He Val Lys Val Ser Lys Asn Lys Gin Glu Met Ala Phe Tyr 
575 580 585 590 

age ctt cct gaa ttt gaa gag tgg aag agt tct- act cca aat cat aaa 



1176 
1224 
1272 
1320 
1368 
1416 

1464 

1512 

1560 

1608 

1656 

1704 

1752 

1800 

1848 
1896 
1944 
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Ser Leu Pro Glu Phe Glu Glu Trp Lys Ser Ser Thr Pro Asn His Lys 
595 600 • 605 

aaa t^gg aaa gtc aaa tat tac aaa ggt ttg ggc acc age aca tea aag 
Lys Trp Lys Val Lys Tyr Tyr Lys Gly Leu Gly Thr Ser Thr Ser Lys 
610 615 6 20 

gaa get aaa gaa tac ttt gca gat atg aaa aga cat cgt ate cag ttc 
Glu Ala Lys Glu Tyr Phe Ala Asp Met Lys Arg His Arg lie Gin Phe 
625 630 635 

aaa tat tct ggt cct gaa gat gat get get ate age ctg gee ttt age 
Lys Tyr Ser Gly Pro Glu Asp Asp Ala Ala lie Ser Leu Ala Phe Ser 
640 645 '650 

f^! ^ 9 9at ° ga aag gaa tgg tta act aat ttc ^g gag 2136 

Lys Lys Gin He Asp Asp Arg Lys Glu. Trp Leu Thr Asn Phe Met Glu ' 

655 660 • 665 670 

gat aga aga caa cga aag tta ctt ggg ctt cct gag gat tac ttg tat 
Asp Arg Arg Gin Arg Lys Leu Leu Gly Leu Pro Glu Asp Tyr Leu Tyr 
675 680 



685 



ril mu a . C a ^ a tat ctg aca tat aat gac ttc ate aac aag gaa 
Gly Gin Thr Thr Thr Tyr Leu Thr Tyr Asn Asp Phe lie Asa Lys Glu 
690 695 



700 



745 750 



765 



770 775 



780 



etc ttg cag ccc att ggt cag ttt ggt acc agg eta cat ggt ggc aag 
Leu Leu Gin Pro lie Gly Gin Phe Gly Thr Arg Leu His Gly Gly Lys 
785 790 795 

gat tct get agt cca cga tac ate ttt aca atg etc age tct ttg get 
Asp Ser Ala Ser Pro Arg Tyr He Phe Thr Met Leu Ser Ser Leu Ala 
8 <>0 805 810 

cga ttg tta ttt cca cca aaa gat gat cac acg ttg aag ttt tta tat 
Arg Leu Leu Phe Pro Pro Lys Asp Asp His Thr Leu Lys Phe Leu Tyr 
815 820 825 830 

gat gac aac cag cgt gtt gag cct gaa tgg tac att cct att att ccc 
Asp Asp Asn Gin Arg Val Glu Pro Glu Trp Tyr He Pro He He Pro 
835 840 845 



1992 



2040 



2088 



2184 



2232 



ill t 5 ° tCa aat tct gat aac W a * a tct a tc cct tct atg 
Leu He Leu Phe Ser Asn Ser Asp Asn Glu Arg Ser He Pro Ser Met 
705 710 • 715 

v!? f at 2? fc x t9 aaa CCa ggt cag aga aa 9 *tt ttg ttt act tgc ttc 
Val Asp Gly Leu Lys Pro Gly Gin Arg Lys Val Leu Phe Thr Cys Phe 
720 725 730 

til til ^ f° ? ag ° 9a 9aa 9ta aa 9"9tt * cc c *a tta get gga tea 2376 
Lys Arg Asn Asp Lys Arg Glu Val Lys Val Ala Gin Leu Ala Gly Ser 
735 740 



2280 



2328 



Va? Ill c Ct tCt tat Cat Cat ggt gag atg tCa cta a ^g atg 2424 

Val Ala Glu Met Ser Ser Tyr His His Gly Glu Met Ser Leu Met Met 
755 760 



?hr ?E T^o aat T ttg 9 f Cag aat fctt gtg ggt agc aat aat cta a ac 2472 
Thr He He Asn Leu Ala Gin Asn Phe Val Gly Ser Asn Asn Leu Asn 



2520 



2568 



2616 



2664 



2712 
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aaa ate ccc aac ttt gat gtg cgt gaa att gta aat aac ate agg cgt 
Lys He Pro Asn Phe Asp Val Arg Glu He Val Asn Asn He Arg Arg 
865 870 875 



2760 



ttg atg gat gga gaa gaa cct ttg cca atg ctt cca agt tac aag aac 
Leu Met Asp Gly Glu Glu Pro Leu Pro Met Leu Pro Ser Tyr Lys Asn 
- 880 885 890 



2808 



ttc aag ggt act att gaa gaa ctg get cca aat caa tat gtg att agt 
Phe Lys Gly Thr He Glu Glu Leu Ala Pro Asn Gin Tyr Val He Ser 
895 900 905 910 



2856 



ggt gaa gta get att ctt aat tct aca ace att gaa ate tea gag ctt 
Gly Glu Val Ala lie Leu Asn Ser Thr Thr He Glu lie Ser Glu Leu 
915 920. 925 



2904 



ccc gtc aga aca tgg ace cag aca tac aaa gaa caa gtt eta gaa ccc 
Pro Val Arg Thr Trp Thr Gin Thr Tyr Lys Glu Gin. Val Leu Glu Pro 
930 935 940 



2952 



atg ttg aat ggc acc gag aag aca cct cct etc ata aca gac tat agg 
Met Leu Asn Gly Thr Glu Lys Thr Pro Pro Leu He Thr Asp Tyr Arg 
945 950 955 



3000 



gaa tac cat aca gat acc act gtg aaa ttt gtt gtg aag atg act gaa 
Glu Tyr His Thr Asp Thr Thr Val Lys Phe Val Val Lys Met Thr Glu 
960 965 970 



3048 



gaa aaa ctg gca gag gca gag aga gtt gga eta cac aaa gtc ttc aaa 
Glu Lys Leu Ala Glu Ala Glu Arg Val Gly Leu His Lys Val Phe Lys 
975 980 985 990 



3096 



etc caa act agt etc aca tgc aac tct atg gtg ctt ttt gac cac gta 
Leu Gin Thr Ser Leu Thr Cys Asn Ser Met Val Leu Phe Asp His Val 
995 1000 1005 



3144 



ggc tgt tta aag aaa tat gac acg gtg ttg gat att eta aga gac 
Gly Cys Leu Lys Lys Tyr Asp Thr Val Leu Asp He Leu Arg Asp 
1010 1015 1020 



3189 



ttt ttt gaa etc aga ctt aaa tat tat gga tta aga aaa gaa tgg 
Phe Phe Glu Leu Arg Leu Lys Tyr .Tyr Gly Leu Arg Lys Glu Trp 
1025 1030 " 1035 



3234 



etc eta gga atg ctt ggt get gaa tct get aaa ctg aat aat cag 
Leu Leu Gly Met Leu Gly Ala Glu Ser Ala Lys Leu Asn Asn Gin 
1040 1045 1050 



3279 



get cgc ttt ate tta gag aaa ata gat ggc aaa ata ate att gaa 
Ala Arg Phe He Leu Glu Lys He Asp Gly Lys He He He Glu 
1055 1060 1065 



3324 



aat aag cct aag aaa gaa tta att aaa gtt ctg att cag agg gga 
Asn Lys Pro Lys Lys Glu Leu He Lys Val Leu He Gin Arg Gly 
1070 1075 1080 



3369 



tat gat teg gat cct gtg aag gee tgg aaa gaa gee cag caa aag 
Tyr Asp Ser Asp Pro Val Lys Ala Trp Lys Glu Ala Gin Gin Lys 
1085 1090 1095 



3414 



gtt cca gat gaa gaa gaa aat gaa gag agt gac aac gaa aag gaa 
Val Pro Asp Glu Glu Glu Asn Glu Glu Ser Asp Asn Glu Lys Glu 
1100 1105 1110 



3459 



act gaa aag agt gac tec gta aca gat tct gga cca acc ttc aac 



3504 



WO 2004/022778 



73 



PCT/AU2003/001166 



Thr Glu Lys Ser Asp Ser Val Thr- Asp. Ser Gly Pro Thr Phe Asn 
1115 1120 1125 

tat ctt ctt gat atg ccc ctt tgg tat tta "acc aag gaa aag aaa 
Tyr Leu Leu Asp Met Pro Leu Trp Tyr Leu Thr Lys Glu Lys Lys 
H30 H35 H40 

gat gaa etc tgc agg eta aga aat gaa aaa gaa caa gag ctg gac 
Asp Glu Leu Cys Arg Leu Arg Asn Glu Lys Glu Gin Glu Leu Asp 
H45 1150 H55 

aca tta aaa aga aag agt cca tea gat ttg tgg aaa gaa gac ttg 
Thr Leu Lys Arg Lys Ser Pro Ser Asp Leu Trp Lys Glu Asp Leu 
1160 1165. H70 

get aca ttt att gaa gaa ttg gag get gtt gaa gec aag gaa aaa 
Ala Thr Phe He Glu Glu Leu Glu Ala Val Glu Ala Lys Glu Lys 
1175 1180 , H85 

caa gat gaa caa gtc gga ctt cct ggg aaa ggg ggg aag gee aag 
Gin Asp Glu Gin Val Gly Leu Pro Gly Lys Gly Gly Lys Ala Lys 
1190 H95 . 1200 

ggg aaa aaa aca caa atg get gaa gtt ttg cct tct ccg cgt ggt 
Gly Lys Lys Thr Gin Met Ala Glu Val Leu Pro Ser Pro Arg Gly 
1205 1210 1215 

caa aga gtc att cca cga ata acc ata gaa atg aaa gca gag gca 
Gin Arg Val lie Pro Arg lie Thr He Glu Met Lys Ala Glu Ala 
1220 1225 1230 

gaa aag aaa aat aaa aag aaa att aag aat gaa aat act gaa gga 
Glu Lys Lys Asn Lys Lys Lys lie Lys Asn Glu Asn Thr Glu Gly 
1235 1240 1245 

age cct caa gaa gat ggt gtg gaa eta gaa ggc eta aaa caa aga 
Ser Pro Gin Glu Asp Gly Val Glu Leu Glu Gly Leu Lys Gin Arg 
1250 1255 1260 

tta gaa aag aaa cag aaa aga gaa cca' ggt aca aag aca aag aaa 
Leu Glu Lys Lys Gin Lys Arg Glu Pro Gly Thr Lys Thr Lys Lys 
1265 1270 1275 

caa act aca ttg gca ttt aag cca ate aaa aaa gga aag aag aga 
Gin Thr Thr Leu Ala Phe Lys Pro lie Lys Lys Gly Lys Lys Arg 
1280 1285 1290 

aat ccc tgg tct gat tea gaa tea gat agg age agt gac gaa agt 
Asn Pro Trp Ser Asp Ser Glu Ser Asp Arg Ser Ser Asp Glu Ser 
1295 1300 1305 



aat ttt gat gtc cct cca cga gaa aca gag cca egg aga gca gca 
Asn Phe Asp Val Pro Pro Arg Glu Thr Glu Pro Arg Arg Ala Ala 
131° 1315 " 1320 

aca aaa aca aaa ttc aca atg gat ttg gat tea gat gaa gat ttc 
Thr Lys Thr Lys Phe Thr Met Asp Leu Asp Ser Asp Glu Asp Phe 
1325 1330 1335 

tea gat ttt gat gaa aaa act gat gat gaa gat ttt gtc cca tea 
Ser Asp Phe Asp Glu Lys Thr Asp Asp Glu Asp Phe Val Pro Ser 
1340 1345 1350 

gat get agt cca cct aag acc aaa act tec cca aaa ctt agt aac 
Asp Ala Ser Pro Pro Lys Thr Lys Thr Ser . Pro Lys Leu Ser Asn 
1355 1360 1365 



3549 
3594 
3639 
3684 

3729 
3774 
3819 
3864 
3909 
3954 
3999 
4044 
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4134 
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aaa gaa ctg aaa cca cag aaa agt gtc gtg tea gac ctt gaa get 
Lys Glu Leu Lys Pro Gin Lys 'Ser Val Val Ser Asp Leu Glu Ala 
1370 1375 ■. 1380 



4269 



gat gat gtt aag ggc agt gta cca ctg tct tea age cct cct get 
Asp Asp Val Lys Gly Ser Val Pro Leu Ser Ser Ser Pro Pro Ala 
1385 1390 1395 



4314 



aca cat ttc cca gat gaa act gaa att aca aac cca gtt cct aaa 
Thr His Phe Pro Asp Glu Thr Glu lie Thr Asn. Pro Val Pro Lys 
1400 1405 1410 



4359 



aag aat gtg aca gtg aag aag aca gca gca aaa agt cag tct tec 
Lys Asn Val Thr Val Lys Lys Thr Ala Ala Lys Ser Gin Ser Ser 
1415 1420 1425 



4404 



acc tec act ace ggt gee aaa aaa 
Thr Ser Thr Thr Gly Ala Lys Lys 
1430 



agg get gee cca aaa gga act 
Arg Ala Ala Pro Lys Gly Thr 
1435 1440 



4449 



aaa agg gat cca get ttg aat tct 
Lys Arg Asp Pro Ala Leu Asn Ser 
1445 



ggt gtc tct caa aag cct gat 
Gly Val Ser Gin Lys Pro Asp 
1450 . 1455 



4494 



cct gec aaa acc aag aat cgc cgc 
Pro Ala Lys Thr Lys Asn Arg Arg 
1460 



aaa agg aag cca tec act tct 
Lys Arg Lys Pro Ser Thr Ser 
1465 1470 



4539 



gat gat tct gac tct aat ttt gag 
Asp Asp Ser Asp Ser Asn Phe Glu 
1475 



aaa att gtt teg aaa gca gtc 
Lys He Val Ser Lys Ala Val 
1480 1485 



4584 



aca age aag aaa tec aag ggg gag 
Thr Ser Lys Lys Ser Lys Gly Glu 
1490 



agt gat gac ttc cat atg gac 
Ser" Asp Asp Phe His Met Asp 
1495 1500 



4629 



ttt gac tea get gtg get cct egg 
Phe Asp Ser Ala Val Ala Pro Arg 
.1505 


gca aaa tct gta egg gca aag 
Ala Lys Ser Val Arg Ala Lys 
1510 1515 


4674 


aaa cct ata aag tac ctg gaa gag 
Lys Pro He Lys Tyr Leu Glu Glu 
1520 


tea gat gaa gat gat ctg ttt 
Ser Asp Glu Asp Asp Leu Phe 
1525 1530 


4719 


taaaatgtga 


ggegattatt 


ttaagtaatt 


atcttaccaa 


gcccaagact 


ggttttaaag 


. 4779 


ttacctgaag 


ctcttaactt 


cctcccctct 


gaatttagtt 


tggggaaggt 


gtttttagta 


4839 


caagacatca 


aagtgaagta 


aagcccaagt 


gttctttagc 


tttttataat 


actgtctaaa 


4899 


tagtgaccat 


ctcatgggca 


ttgttttctt 


etctgetttg 


tctgtgtttt 


gagtctgett 


4959 


tcttttgtct 


ttaaaacctg 


atttttaagt 


tcttctgaac 


tgtagaaata 


gctatctgat 


5019 


cacttcagcg 


taaagcagtg 


tgtttattaa 


ccatccacta 


agctaaaact 


agagcagttt 


5079 


gatttaaaag 


tgtcactctt 


cctccttttc 


tactttcagt 


agatatgaga 


tagagcataa 


5139 


ttatctgttt 


tatcttagtt 


ttatacataa 


tttaccatca 


gatagaactt 


tatggttcta 


5199 


gtacagatac 


tctactacac 


tcagcctctt 


atgtgccaag 


tttttcttta 


agcaatgaga 


5259 


aattgetcat 


gttcttcatc 


ttctcaaatc 


ateagaggee 


aaagaaaaac 


actttggctg 


5319 


tgtctataac 


ttgacacagt 


caatagaatg 


aagaaaatta 


gagtagttat 


gtgattattt 


5379 
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cagctcttga cctgtcccct ctggctgcct ctgagtctga atctcccaaa gagagaaacc 5439 

aa'tttctaag aggactggat tgcagaagac tcggggacaa catttgatcc aagatcttaa 5499 

atgttatatt gataaccatg ctcagcaatg agctattaga ttcattttgg gaaatctcca 5559 

taatttcaat ttgtaaactt tgttaagacc tgtctacatt gttatatgtg tgtgacttga 5619 

gtaatgttat caacgttttt gtaaatattt actatgtttt tctattagct aaattccaac 5679 

aattttgtac tttaataaa 5698 



<210> 


30 


<211> 


1531 


<212> 


PRT 


<213> 


NM 001067 TOP2A 



<400>< 30 

Met Glu Val Ser Pro Leu Gin Pro Val Asn Glu Asn Met Gin Val Asn 
1 5 10' 15 



.Lys He Lys Lys Asn Glu Asp Ala Lys Lys Arg Leu Ser Val Glu Arg 
20 25 30 



He Tyr Gin Lys Lys Thr Gin Leu Glu His He Leu Leu Arg Pro Asp 
35 40 45 



Thr Tyr He Gly Ser Val Glu Leu Val Thr Gin Gin Met Trp Val Tyr 
50 55 60 



Asp Glu Asp Val Gly He Asn Tyr Arg Glu Val Thr Phe Val Pro Gly 
65 70 75 80 



Leu Tyr Lys He Phe Asp Glu He Leu Val Asn Ala Ala Asp Asn Lys 
85 90 95 



Gin Arg Asp Pro Lys Met Ser Cys He Arg Val Thr He Asp Pro Glu 
100 105 HO 



Asn Asn Leu He Ser He Trp Asn Asn Gly Lys Gly He Pro Val Val 
115 120 125 



Glu His Lys Val Glu Lys Met Tyr Val Pro Ala Leu He Phe Gly Gin 
130 135 140 



Leu Leu Thr Ser Ser Asn Tyr Asp Asp Asp Glu Lys Lys Val Thr Gly 
145 150 155 160 



Gly Arg Asn Gly Tyr Gly Ala Lys Leu Cys Asn He Phe Ser Thr Lys 
165 170 175 
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Phe Thr Val Glu Thr Ala Ser Arg Glu Tyr Lys Lys Met Phe Lys Gin 
180 185 190 



Thr Trp Me.t Asp Asn Met Gly Arg Ala Gly Glu Met Glu Leu Lys Pro 
195 200 205 



Phe Asn Gly Glu Asp Tyr Thr Cys He Thr Phe Gin Pro Asp Leu Ser 
210 215 220 



Lys Phe Lys Met Gin Ser Leu Asp Lys Asp He Val Ala Leu Met Val 
225 230 235 240 



Arg Arg Ala Tyr Asp He Ala Gly Ser Thr Lys Asp Val Lys Val Phe 
245 250 255 



Leu Asn Gly Asn Lys Leu Pro Val Lys Gly Phe Arg Ser Tyr Val Asp 
260 265 " " 210 



Met Tyr Leu Lys Asp Lys Leu Asp Glu Thr Gly Asn Ser Leu Lys Val 
275 280 285 



He His Glu Gin Val Asn His Arg Trp Glu Val Cys Leu Thr Met Ser 
290 295 300 



Glu Lys Gly Phe Gin Gin He Ser Phe Val Asn Ser He Ala Thr Ser 
305 310 315 320 



Lys Gly Gly Arg His Val Asp Tyr Val Ala Asp Gin He Val Thr Lys 
325 330 335 



Leu Val Asp Val Val Lys Lys Lys Asn Lys Gly Gly Val Ala Val Lys 
340 345 " 350 



Ala His Gin Val Lys Asn His Met Trp He Phe Val Asn Ala Leu He 
355 360 365 



Glu Asn Pro Thr Phe Asp Ser Gin Thr Lys Glu Asn Met Thr Leu Gin 
370 375 380 



Pro Lys Ser Phe Gly Ser Thr Cys Gin Leu Ser Glu Lys Phe He Lys 
385 390 395 400 



Ala Ala He Gly Cys Gly He Val Glu Ser He Leu Asn Trp Val Lys 
405 410 415 



Phe Lys Ala Gin Val Gin Leu Asn Lys Lys Cys Ser Ala Val Lys His 
420 425 " 430 



Asn Arg He Lys Gly He Pro Lys Leu Asp Asp Ala Asn Asp Ala Gly 
435 440 445 



\ ) } 
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Gly Arg Asn Ser Thr Glu Cys Thr Leu lie Leu Thr Glu Gly Asp Ser 
450 AtTr 



455 



460 



Ala Lys Thr Leu Ala Val Ser Gly Leu Gly Val Val Gly Arg Asp Lys 
465 470 475 480 

Tyr Gly Val Phe Pro Leu Arg Gly Lys lie Leu Asn Val Arg Glu Ala 
485 490 495 



Ser His Lys Gin He Met Glu Asn Ala Glu He Asn Asn He He Lys 
500 505 



510 



He Val Gly Leu Gin Tyr Lys Lys Asn Tyr Glu Asp Glu Asp Ser Leu 
515 .520 



525 



LyS So Tyr GlY — Ile Met Ile Met Thr Gln Asp Gln 



535 



540 



Asp Gly Ser His lie Lys Gly Leu Leu lie Asn Phe Ile His His Asn 
545 550 555 560 

Trp Pro Ser Leu Leu Arg His Arg Phe Leu Glu Glu Phe He Thr Pro 
565 570 575 



He Val Lys Val Ser Lys Asn Lys Gln Glu Met Ala Phe Tyr Ser Leu 
580 585 5 5 o 




Asp Asp Arg Lys Glu Trp Leu Thr Asn Phe Met Glu Asp Ara 
66 <> 665 670 



Arg Gln Arg Lys Leu Leu Gly Leu Pro Glu Asp Tyr Leu Tyr Gly Gln 
675 680 685 



Thr Thr Thr Tyr Leu Thr Tyr Asn Asp Phe Ile Asn Lys Glu Leu lie 
690 695 700 



Leu Phe Ser Asn Ser Asp Asn Glu Arg Ser He Pro Ser Met Val Asp 



O } 
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705 710 • 715 



720 



Gly Leu Lys Pro Gly Gin Arg Lys Val Leu Phe Thr Cys Phe Lys Arg 
725 730 735 

Asn Asp Lys Arg Glu Val Lys Val Ala Gin Leu Ala Gly Ser Val Ala 
740 745 750 

Glu Met Ser Ser Tyr His His Gly Glu Met Ser Leu Met Met Thr He 
755 760 765 

He Asn Leu Ala Gin Asn Phe Val Gly Ser Asn Asn Leu Asn Leu Leu 
770 775 780 

Gin Pro He Gly Gin Phe Gly Thr Arg Leu His Gly Gly Lys Asp Ser 
785 7 ^0 795 ' " 800 

Ala Ser Pro Arg Tyr He Phe Thr Met Leu Ser Ser Leu Ala Arg Leu 
805 810 815 

Leu Phe Pro Pro Lys Asp Asp His Thr Leu Lys Phe Leu Tyr Asp Asp 
820 825 830 

Asn Gin Arg Val Glu Pro Glu Trp Tyr He Pro He He Pro Met Val 
835 840 845 

Leu He Asn Gly Ala Glu Gly He Gly Thr Gly Trp Ser Cys Lys He 
850 855 860 

' Pro Asn Phe Asp Val Arg Glu He Val Asn Asn He Arg Arg Leu Met 
865 87 <> 875 " 880 

Asp Gly Glu Glu Pro Leu Pro Met Leu Pro Ser Tyr Lys Asn Phe Lys 
885 890 895 

Gly Thr He Glu Glu Leu Ala Pro Asn Gin Tyr Val He Ser Gly Glu 
900 905 910 

Val Ala He Leu Asn Ser Thr Thr He Glu He Ser Glu Leu Pro Val 
915 920 925 

Arg Thr Trp Thr Gin Thr Tyr Lys Glu Gin Val Leu Glu Pro Met Leu 
930 935 940 

Asn Gly Thr Glu Lys Thr Pro Pro Leu He Thr Asp Tyr Arq Glu Tvr 
945 950 955 960 

His Thr Asp Thr Thr Val Lys Phe Val Val Lys Met Thr Glu Glu Lys 
965 970 975 



Leu Ala Glu Ala Glu Arg Val Gly Leu His Lys Val Phe Lys Leu Gin 
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980 985 



990 



Thr Ser Leu Thr Cys Asn Ser Met Val Leu Phe Asp His Val Gly Cys 
995 • 1000 1005 

LeU ^, S „ LyS Tyr Val Leu Ile Le " Arg Asp Phe Phe 

1010 10 i5 1020 

G1U S £ *** LyS Tyr Tyr Gly ^ Arg Lys Glu Trp Leu Leu 
1025 1030 xo35 

Gly ^ Gly Ala Glu Ser Ala fl sn Asn Gin Ala Arg 

1040 1045 1050 

Phe lie Leu Glu Lys He Asp Gly Lys Ile Ile Ile Glu Asn Lys 
1055 io60 1065 

Pr ° ^„ LyS G1U LeU Ile Lys Val leu Ile Gln Arg Gly Tyr Asp ' 
1070 1075 1080 

SSr Pr ° Val LyS Ala Trp L * s Glu Ala G m SI" 1*3 Val Pro 

1085 1090 1095 

Asp Glu Glu Glu Asn Glu Glu Ser Asp Asn Glu Lys Glu Thr Glu 
1100 H05 mo 

Lys Ser Asp Ser Val Thr Asp Ser Gly Pro Thr Phe Asn Tyr Leu 
1H5 H20 1125 

Leu Asp Met Pro Leu Trp Tyr Leu Thr Lys Glu Lys Lys Asp Glu 
1130 H35 U40 

Leu Cys Arg Leu Arg Asn Glu Lys Glu Gin Glu Leu Asp Thr Leu 
H45 U50 U55 

Lys Arg Lys Ser Pro Ser Asp Leu Trp Lys Glu Asp Leu Ala Thr 
1160 U65 n7o 

Phe lie Glu Glu Leu Glu Ala Val Glu Ala Lys Glu Lys Gin Asp 
H75 u8o 1185 

GlU ^ an Gly L6U Pr ° Gly LyS Gly Gly Lys *** Lys Gly Lys 

1190 1195 1200 

Lys Thr Gin Met Ala Glu Val Leu Pro Ser Pro Arg Gly Gin Arg 
1205 1210 1215 

Val Pr ° ^ Ile Tnr Ile Glu Met L * 3 Ma Glu Ala Glu Lys 

1220 1225 1230 



Lys Asn Lys Lys Lys lie Lys Asn Glu Asn Thr Glu Gly Ser Pro 
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1235 1240 1245 

Gin Gin Asp Gly Val Glu Leu Glu Gly Leu Lys Gin Arg Leu Glu 
1250 1255 1260 

Lys Lys Gin Lys Arg Glu Pro Gly .Thr.Lys Thr Lys Lys Gin Thr 
1265 1270 1275 

Thr Leu Ala Phe Lys Pro He Lys Lys Gly Lys Lys Arg Asn Pro 
1280 1285 " ' 1290 

Trp Ser Asp Ser Glu Ser Asp Arg Ser Ser Asp Glu Ser Asn Phe 
1295 1300 1305 

Asp Val Pro Pro Arg Glu Thr Glu Pro Arg Arg Ala Ala Thr Lys 
1310 1315 1320 

Thr Lys Phe Thr Met Asp Leu Asp Ser Asp Glu Asp Phe Ser Asp 
1325 1330 1335 

Phe Asp Glu Lys Thr Asp Asp Glu Asp Phe Val Pro Ser Asp Ala 
1340 1345 1350 



Ser Pro Pro Lys Thr Lys Thr Ser Pro Lys Leu Ser Asn Lys Glu 
1355 1360 1365 



Leu Lys Pro Gin Lys Ser Val Val Ser Asp Leu Glu Ala Asp Asp 
1370 1375 1380 



Val Lys Gly Ser Val Pro Leu Ser Ser Ser Pro Pro Ala Thr His 
1385 1390 1395 



Phe Pro Asp Glu Thr Glu He Thr Asn Pro Val Pro Lys Lys Asn 
1400 1405 1410 



Val Thr Val Lys Lys Thr Ala Ala Lys Ser Gin Ser Ser Thr Ser 
1415 1420 1425 



Thr Thr Gly Ala Lys Lys Arg Ala Ala Pro Lys Gly Thr Lys Arg 
1430 1435 1440 



Asp Pro Ala Leu Asn Ser Gly Val Ser Gin Lys Pro Asp Pro Ala 
1445 1450 1455 



Lys Thr Lys Asn Arg Arg Lys Arg Lys Pro Ser Thr Ser Asp Asp 
1460 1465 1470 



Ser Asp Ser Asn Phe Glu Lys He Val Ser Lys Ala Val Thr Ser 
1475 1480 1485 
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Lys Lys Ser Lys Gly Glu Ser Asp Asp Phe His Met Asp Phe Asp 
1490 1495 1500 



Ser Ala Val Ala Pro Arg Ala Lys Ser Val Arg Ala Lys Lys Pro 
1505 1510 1515 



He Lys Tyr Leu. Glu Glu Ser. Asp Glu Asp Asp Leu Phe 
1520 1525 1530 



<210> 31 

<211> 1369 

<212> DNA 

<213> NMJL38410 chemokine-like factor super family 7, CKLFSF7 



<220> 

<221> CDS 

<222> {237) . . (761) 

<223> 



<400> 31 

agtctaactt ccgcgcatct acgagggccg ggactgccgc tacctttctg gaaggcgccg 60 

gccggccagt caccggaggg atcccgccag aggcggctcc gcgtctccag cagcggggca 120 

gggacccggg cgccccgccc tcgccagcgc ccgcccccgc ccgcccccgg cccgccctct 180 

gtatctggcc cctgggcagc tgcccgggga ggcggccagc gagctggggc cgcgca atg 239 

Met 
I 

teg cac gga gec ggg etc gtc cgc acc acg tgc age age ggc age gcg 287 
Ser His Gly Ala Gly Leu Val Arg Thr Thr Cys Ser Ser Gly Ser Ala 
5 10 15 

etc gga ccc ggg gec ggc gcg gec cag ccc age gcg age ccc ttg gag 335 
Leu Gly Pro Gly Ala Gly Ala Ala Gin Pro Ser Ala Ser Pro Leu Glu 
20 25 30 

ggg ctg ctg gac etc age tac ccc cgc acc cac gcg gec ctg ctg aaa 383 
Gly Leu Leu Asp Leu Ser Tyr Pro Arg Thr His Ala Ala Leu Leu Lys 
35 40 45 

gtg gcg caa atg gtc acc ctg ctg att gec ttc ate tgt gtg egg age 431 
Val Ala Gin Met Val Thr Leu Leu He Ala Phe He Cys Val Arg Ser 
50 55 60 . 65 

tec ctg tgg acc aac tac age gec tac age tac ttt gaa gtg gtc acc 479 
Ser Leu Trp Thr Asn Tyr Ser Ala Tyr Ser Tyr Phe Glu Val Val Thr 
70 75 80 

att tgc gac ttg ata atg ate etc gec ttt tac ctg gtc cac etc ttc 527 
He Cys Asp Leu He Met He Leu Ala Phe Tyr Leu Val His Leu Phe 
85 90 95 

cgc ttc tac cgc gtg etc acc tgt ate age tgg ccc ctg teg gaa ctt 575 
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Arg Phe Tyr Arg Val Leu Thr Cys He Ser Trp 
100 ■ 105 

ctg cac tat tta ate ggt acc ctg etc etc etc 
Leu His Tyr Leu He Gly Thr Leu Leu Leu Leu 
115 120 

gca get tec aag agt tac aae cag age gga ctg 
Ala Ala Ser Lys Ser Tyr Asn Gin Ser Gly Leu 
130 135 140 

ttt ggt ttc atg gee acc ttc etc tgc atg gca 
Phe Gly Phe Met Ala Thr Phe Leu Cys Met Ala 

.150 155 

tat aag ate teg tgt gta acc cag tec aca gat 
Tyr Lys He Ser Cys Val Thr Gin Ser Thr Asp 
165 170 

tgatgaggee acaaccccta ggcccctcag gagctttgea 

aggegaggee tctggacctg tgttcctgtg ccaaagtcct 

gaaaggcctg caccctcttc ctgctctccc aggaagecag 

geeggaaact cttcctccag ccttccgggg agaacatccc 

gcagcctcca gggaaatgtt ttctgccttc ctgettctag 

aaccccactt agcacagctg aaggggtttg tgaatactcc 

cactcctcag gggagtgaag tgecttaaga aacaaagccc 

gtcagtcegg tcttagagat accctctttc ctgaagtgag 

tgtggtcagc ctgtccccaa ggagatcttg tgtctcctct 

agtgtgcatg tgtttgtgtg ttttttaata aaatattgac 

aaaaaaaa 



Pro Leu Ser Glu Leu 
110 

ate gec tec att gtg 623 
He Ala Ser He Val 
125 

gta gec gga gcg ate 671 
Val Ala Gly Ala He 
145 

age ata tgg ctg tec 719 
Ser He Trp Leu Ser 
160 

gca gee gtc 761 
Ala Ala Val 
175 

gagaggagga cgtgtactcc 821 
gtcaggctgg tgggcaccag 881 
ctccctgagc tcctgagcca 941 
tcccattctg ggaaaggaaa 1001 
aaccacctca ggtactgatg 1061 
cgcctaaatc ccttctactt 1121 
tgtcctaatt tatctagctt 1181 
gcgtgcctgt agaaacacta ' 1241 
ccatctctgc ctttgttacc 1301 
teggecagtt aaaaaaaaaa 1361 

1369 



<210> 


32 


<211> 


175 


<212> 


PRT 


<213> 


NM 



<400> 32 

Met Ser His Gly Ala Gly Leu Val Arg Thr Thr Cys Ser Ser Gly Ser 
1 5 10 15 

Ala Leu Gly Pro Gly Ala Gly Ala Ala Gin Pro Ser Ala Ser Pro Leu 
20 25 30 

Glu Gly Leu Leu Asp Leu Ser Tyr Pro Arg Thr His Ala Ala Leu Leu 
35 40 45 



Lys Val Ala Gin Met Val Thr Leu Leu He Ala Phe He Cys Val Arq 
50 55 60 
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Ser Ser Leu Trp Thr Asn Tyr Ser Ala Tyr Ser Tyr Phe Glu Val Val 
65 70 .75 . 80 



Thr He Cys Asp Leu He Met He Leu Ala Phe Tyr Leu Val His Leu 
85 90 95 



Phe Arg Phe Tyr Arg Val Leu Thr Cys -He Ser Trp Pro Leu Ser Glu 
100 105 110 



Leu Leu His Tyr Leu He Gly Thr Leu Leu Leu Leu He Ala Ser He 
115 120 125 



Val Ala Ala Ser Lys Ser Tyr Asn Gin Ser Gly Leu Val Ala Gly Ala 
130 135 140 



He Phe Gly Phe Met Ala Thr Phe Leu Cys Met Ala Ser He Trp Leu 
145 150 155 160 



Ser Tyr Lys He Ser Cys Val Thr Gin Ser Thr Asp Ala Ala Val 
165 170 175 

<210> 33 
<211> 3470 
<212> DNA 

<213> NM_012112 C20orfl / chromosome' 20 open reading frame 1 



<220> 

<221> CDS 

<222> (458) . . (2698) 

<223> 



<400> 33 
ggctaggtga 


gccgtgggaa gaaaagaggg 


agcagctagg 


gegegggtet 


ccctcctccc 


60 


ggagtttgga 


acggctgaag ttcaccttcc 


agcccctagc 


gccgttcgcg 


ccgctaggcc 


120 


tggcttctga 


ggcggttgcg gtgctcggtc 


gccgcctagg 


eggggcaggg 


tgegagcagg 


180 


ggcttcgggc 


cacgcttctc ttggcgacag 


gattttgctg 


tgaagtccgt 


ccgggaaacg 


240 


gaggaaaaaa 


agagttgcgg gaggctgtcg 


gctaataacg 


gttcttgata 


catatttgee 


300 


agacttcaag 


atttcagaaa aggggtgaaa 


gagaagattg 


caactttgag 


tcagacctgt 


360 


aggcctgata 


gactgattaa accacagaag 


gtgacctgct 


gagaaaagtg 


gtacaaatac 


420 


tgggaaaaac 


ctgctcttct gcgttaagtg 


ggagaca atg tea caa gtt aaa age 


475 



Met Ser Gin Val Lys Ser 
1 5 
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tct tat tec tat gat gec ccc teg gat ttc ate aat ttt tea tec ttg 523 
Ser Tyr Ser Tyr Asp Ala Pro Ser Asp Phe He Asn Phe Ser Ser Leu 
10 15 20 

gat gat gaa gga gat act caa aac ata gat tea tgg ttt gag gag aag 571 
Asp Asp Glu Gly Asp Thr Gin Asn He Asp Ser Trp Phe Glu Glu Lys 
25 30 35 

gee aat ttg gag aat aag tta ctg ggg aag aat gga act gga ggg ctt 619 
Ala Asn Leu Glu Asn Lys Leu Leu Gly Lys Asn Gly Thr Gly Gly Leu 
40 .45 . . 50 

ttt cag ggc aaa act cct ttg aga aag get aat ctt cag caa get att 667 
Phe Gin Gly Lys Thr Pro Leu Arg Lys Ala Asn Leu Gin Gin Ala He 
55 60 ' 65 70 

gtc aca cct ttg aaa cca gtt gac aac act tac tac aaa gag gca gaa 715 
Val Thr Pro Leu Lys Pro Val Asp Asn Thr Tyr . Tyr Lys Glu Ala Glu 
75 80 85 

aaa gaa aat ctt gtg gaa caa tec att ccg tea aat get tgt tct tec 763 
Lys Glu Asn Leu Val Glu Gin Ser He Pro Ser Asn Ala Cys Ser Ser 
90 95 100 

ctg gaa gtt gag gca gee ata tea aga aaa act cca gee cag cct cag 811 
Leu Glu Val Glu Ala Ala He Ser Arg Lys Thr Pro Ala Gin Pro Gin 
105 110 115 

aga aga tct ctt agg ctt tct get cag aag gat ttg gaa cag aaa gaa 859 
Arg Arg Ser Leu Arg Leu Ser Ala Gin Lys Asp Leu Glu Gin Lys Glu 
120 125 130 

aag cat cat gta aaa atg aaa gec aag aga tgt gee act cct gta ate 907 
Lys His His Val Lys Met Lys Ala Lys Arg Cys Ala Thr Pro Val He 
135 140 145 150 

ate gat gaa att eta ccc tct aag aaa atg aaa gtt tct aac aac aaa 955 
He Asp Glu He Leu Pro Ser Lys Lys Met Lys Val Ser Asn Asn Lys 
155 160 165 

aag aag cca gag gaa gaa ggc agt get cat caa gat act get gaa aag 1003 
Lys Lys Pro Glu Glu Glu Gly Ser Ala His Gin Asp Thr Ala Glu Lys 
170 175 180 

aat gca tct tec cca gag aaa gee aag ggt aga cat act gtg cct tgt 1051 
Asn Ala Ser Ser Pro Glu Lys Ala Lys Gly Arg His Thr Val Pro Cys 
185 190 195 

atg cca cct gca aag cag aag ttt eta aaa agt act gag gag caa gag 1099 
Met Pro Pro Ala Lys Gin Lys Phe Leu Lys Ser Thr Glu Glu Gin Glu 
200 205 210 

ctg gag aag agt atg aaa atg cag caa gag gtg gtg gag atg egg aaa 1147 
Leu Glu Lys Ser Met Lys Met Gin Gin Glu Val Val Glu Met Arg Lys 
215 220 225 230 

aag aat gaa gaa ttc aag aaa ctt get ctg get gga ata ggg caa cct 1195 
Lys Asn Glu Glu Phe Lys Lys Leu Ala Leu Ala Gly He Gly Gin Pro 
235 240 245 

gtg aag aaa tea gtg age cag gtc acc aaa tea gtt gac ttc cac ttc 1243 
Val Lys Lys Ser Val Ser Gin Val Thr Lys Ser Val Asp Phe His Phe 
250 255 260 

cgc aca gat gag cga ate aaa caa cat cct aag aac cag gag gaa tat 1291 
Arg Thr Asp Glu Arg He Lys Gin His Pro Ly3 Asn Gin Glu Glu Tyr 
265 270 275 
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aag gaa gtg aac ttt aca tct gaa eta cga aag cat cct tea tct cct 1339 
Lys Glu Val Asn Phe Thr Ser Glu Leu Arg Lys His Pro Ser Ser Pro 
280 285 290 

gee cga gtg act aag gga tgt acc att gtt aag cct ttc aac ctg tec 1387 
Ala Arg Val Thr Lys Gly Cys Thr He Val Lys Pro Phe Asn Leu Ser 
295 300 305 310 

caa gga aag aaa aga aca ttt gat gaa aca gtt tct aca tat gtg ccc 1435 
Gin Gly Lys Lys Arg Thr Phe Asp Glu Thr Val Ser Thr Tyr Val Pro 
315 320 325 

ctt gca cag caa gtt gaa gac ttc cat aaa cga acc cct aac aga tat 1483 
Leu Ala Gin Gin Val Glu Asp Phe His Lys Arg Thr Pro Asn Arg Tyr 
330 335 .340 

cat ttg agg age aag aag gat gat att aac ctg tta ccc tec aaa tct 1531 
His Leu Arg Ser Lys Lys Asp Asp He Asn Leu Leu Pro Ser Lys Ser 
345 350 355 

tct gtg acc aag att tgc aga gac cca cag act cct gta ctg caa acc 1579 
Ser Val Thr Lys He Cys Arg Asp Pro Gin Thr Pro Val Leu Gin Thr 
360 365 370 

aaa cac cgt gca egg get gtg acc tgc aaa agt aca gca gag ctg gag 1627 
Lys His Arg Ala Arg Ala Val Thr Cys Lys Ser Thr Ala Glu Leu Glu 
375 380 . 385 390 

get gag gag etc gag aaa ttg caa caa tac aaa ttc aaa gca cgt gaa 1675 
Ala Glu Glu Leu Glu Lys Leu Gin Gin Tyr Lys Phe Lys Ala Arg Glu 
395 400 405 

ctt gat ccc aga ata ctt gaa ggt ggg ccc ate ttg ccc aag aaa cca 1723 
Leu Asp Pro Arg He Leu Glu Gly Gly Pro He Leu Pro Lys Lys Pro 
410 415 420 

cct gtg aaa cca ccc acc gag cct att ggc ttt gat ttg gaa att gag 1771 
Pro Val Lys Pro Pro Thr Glu Pro He Gly Phe Asp Leu Glu He Glu 
425 430 435 

aaa aga ate cag gag cga gaa tea aag aag aaa aca gag gat gaa cac 1819 
Lys Arg He Gin Glu Arg Glu Ser Lys Lys Lys Thr Glu Asp Glu His 
440 445 450 

ttt gaa ttt cat tec aga cct tgc cct act aag att ttg gaa gat gtt 1867 
Phe Glu Phe His Ser Arg Pro Cys Pro Thr Lys He Leu Glu Asp Val 
455 460 465 470 

gtg ggt gtt cct gaa aag aag gta ctt cca. ate acc gtc ccc aag tea 1915 
Val Gly Val Pro Glu Lys Lys Val Leu Pro lie Thr Val Pro Lys Ser 
475 480 485 

cca gee ttt gca ttg aag aac aga att cga atg ccc acc aaa gaa gat 1963 
Pro Ala Phe Ala Leu Lys Asn Arg He Arg Met Pro Thr Lys Glu Asp 
490 495 500 

gag gaa gag gac gaa ccg gta gtg ata aaa get caa cct gtg cca cat 2011 
Glu Glu Glu Asp Glu Pro Val Val lie Lys Ala Gin Pro Val Pro His 
505 510 515 

tat ggg gtg cct ttt aag ccc caa ate cca gag gca aga act gtg gaa 2059 
Tyr Gly Val Pro Phe Lys Pro Gin lie Pro Glu Ala Arg Thr Val Glu 
520 525 530 

ata tgc cct ttc teg ttt gat tct cga gac aaa gaa cgt cag tta cag 2107 
lie Cys Pro Phe Ser Phe Asp Ser Arg Asp Lys Glu Arg Gin Leu Gin 
535 540 545 550 
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aag gag aag aaa ata aaa gaa ctg cag aaa ggg gag gtg ccc aag ttc 2155 
Lys Glu Lys Lys lie Lys Glu Leu Gin Lys Gly Glu Val Pro Lys Phe 
555 560 565 

aag gca ctt ccc ttg cct cat ttt gac acc att aac ctg cca gag aag 2203 
Lys Ala Leu Pro Leu Pro His Phe Asp Thr lie Asn Leu Pro Glu Lys 
570 575 580 

aag gta aag aat gtg acc cag att gaa cct ttc tgc ttg gag act gac 2251 
Lys Val Lys Asn Val Thr Gin lie Glu Pro Phe Cys Leu Glu Thr Asp 
585 590 595 

aga aga ggt get ctg aag gca cag act tgg aag cac cag ctg gaa gaa 229 r 9 
Arg Arg Gly Ala Leu Lys Ala Gin Thr Trp Lys His Gin Leu Glu Glu 
600 605 610 

gaa ctg aga cag cag aaa gaa gca get tgt ttc aag get cgt cca aac 2347 
Glu Leu Arg Gin Gin Lys Glu Ala Ala Cys Phe Lys Ala Arg Pro Asn 
615 620 625 630 

acc gtc ate tct cag gag ccc ttt gtt ccc aag aaa gag aag aaa tea 2395 
Thr Val He Ser Gin Glu Pro Phe Val Pro Lys Lys Glu Lys Lys Ser 
635 640 645 

gtt get gag ggc ctt tct ggt tct eta gtt cag gaa cct ttt cag ctg 2443 
Val Ala Glu Gly Leu Ser Gly Ser Leu Val Gin Glu Pro Phe Gin Leu 
650 655 . 660 

get act gag aag aga gee aaa gag egg cag gag ctg gag aag aga atg 2491 
Ala Thr Glu Lys Arg Ala Lys Glu Arg Gin Glu Leu Glu Lys Arg Met 
665 670 675 

get gag gta gaa gee cag aaa gee cag cag ttg gag gag gee aga eta 2539 
Ala Glu Val Glu Ala Gin Lys Ala Gin Gin Leu Glu Glu Ala Arg Leu 
680 685 690 

cag gag gaa gag cag aaa aaa gag gag ctg gee agg eta egg aga gaa 2587 
Gin Glu Glu Glu Gin Lys Lys Glu Glu Leu Ala Arg Leu Arg Arg Glu 
695 . 700 705 710 

ctg gtg cat aag gca aat cca ata cgc aag tac cag ggt ctg gag ata 2635 
.Leu Val His Lys Ala Asn Pro He Arg Lys Tyr Gin Gly Leu Glu He 
715 720 725 

aag tea agt gac cag cct ctg act gtg cct gta tct ccc aaa ttc tec 2683 
Lys Ser Ser Asp Gin Pro Leu Thr Val Pro Val Ser Pro Lys Phe Ser 
730 735 740 

act cga ttc cac tgc taaactcagc tgtgagctgc ggataccgcc cggcaatggg 2738 
Thr Arg Phe His Cys 
745 



acctgctctt 


aacctcaaac 


etaggacegt 


ettgetttgt 


cattgggcat 


ggagagaacc 


2798 


catttctcca 


gacttttacc 


tacccgtgcc 


tgagaaagca 


tacttgacaa 


ctgtggactc 


2858 


cagttttgtt 


gagaattgtt 


ttcttacatt 


actaaggcta 


ataatgagat 


gtaactcatg 


2918 


aatgtctcga 


ttagactcca 


tgtagttact 


tcctttaaac 


catcagccgg 


ccttttatat 


2978 


gggtcttcac 


tctgactaga 


atttagtctc 


tgtgtcagca 


cagtgtaatc 


tetattgeta 


3038 


ttgcccctta 


cgactctcac 


cctctcccca 


ctttttttaa 


aaattttaac 


cagaaaataa 


3098 


agatagttaa 


atcctaagat 


agagattaag 


tcatggttta 


aatgaggaac 


aatcagtaaa 


3158 


tcagattctg 


tcctcttctc 


tgcataccgt 


gaatttatag 


ttaaggatcc 


ctttgctgtg 


3218 
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agggtagaaa acctcaccaa ctgcaccagt gaggaagaag actgcgtgga ttcatgggga 3278 

gcctcacagc agccacgcag caggctctgg gtggggctgc cgttaaggca cgttctttcc 3338 

ttactggtgc tgataacaac agggaaccgt gcagtgtgca ttttaagacc tggcctggaa 3398 

taaatacgtt ttgtctttcc ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3458 

aaaaaaaaaa aa 3470 

<210> 34 

<211> 747 

<212> PRT 

<213> KM_012112 C20orfl, chromosome 20 open reading frame 1 

<400> 34 

Met Ser Gin Val Lys Ser Ser Tyr Ser Tyr Asp Ala Pro Ser Asp Phe 
1 5 10 15 



lie Asn Phe Ser Ser Leu Asp Asp Glu Gly Asp Thr Gin Asn lie Asp 
20 25 30 



Ser Trp Phe Glu Glu Lys Ala Asn Leu Glu Asn Lys "Leu Leu Gly Lys 
35 40 45 



Asn Gly Thr Gly Gly Leu Phe Gin Gly Lys Thr Pro Leu Arg Lys Ala 
50 55 60 



Asn Leu Gin Gin Ala lie Val Thr Pro Leu Lys Pro Val Asp Asn Thr 
65 70 75 . 80 



Tyr Tyr Lys Glu Ala Glu Lys Glu Asn Leu Val Glu Gin Ser lie Pro 
85 90 95 



Ser Asn Ala Cys Ser Ser Leu Glu Val Glu Ala Ala lie Ser Arg Lys 
100 105 : 110 



Thr Pro Ala Gin Pro Gin Arg Arg Ser Leu Arg Leu Ser Ala Gin Lys 
115 120 125 



Asp Leu Glu Gin Lys Glu Lys His His Val Lys Met Lys Ala Lys Arg 
130 135 * 140 



Cys Ala Thr Pro Val He He Asp Glu He Leu Pro Ser Lys Lys Met 
145 150 155 160 



Lys Val Ser Asn Asn Lys Lys Lys Pro Glu Glu Glu Gly Ser Ala His 
165 170 175 
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Gin Asp Thr Ala Glu Lys Asn Ala Ser Ser Pro Glu Lys Ala Lys Gly 
180 185 190 

Arg His Thr Val Pro Cys Met Pro Pro Ala Lys Gin Lys Phe Leu Lys 
195 200 205 

Ser Thr Glu Glu Gin Glu Leu Glu Lys Ser Met Lys Met Gin Gin Glu 
210 215 220 

Val Val Glu Met Arg Lys Lys Asn Glu Glu Phe Lys Lys Leu Ala Leu 
225 230 235 240 

Ala Gly He Gly Gin Pro Val Lys Lys Ser Val Ser Gin Val Thr Lys 
245 250 255 

Ser Val Asp Phe His Phe Arg Thr Asp Glu Arg He Lys Gin His Pro 
260 265 270 

Lys Asn Gin Glu Glu Tyr Lys Glu Val Asn Phe' Thr Ser Glu Leu Arg 
275 280 285 

Lys His Pro Ser Ser Pro Ala Arg Val Thr Lys Gly Cys Thr He Val 
290 295 300 

Lys Pro Phe Asn Leu Ser Gin Gly Lys Lys Arg Thr Phe Asp Glu Thr 
305 310 315 320 

Val Ser Thr Tyr Val Pro Leu Ala Gin Gin Val Glu Asp Phe His Lys 
325 330' 335 

Arg Thr Pro Asn Arg Tyr His Leu Arg Ser Lys Lys Asp Asp He Asn 
340 345 350 

Leu Leu Pro Ser Lys Ser Ser Val Thr Lys He Cys Arg Asp Pro Gin 
355 360 365 

Thr Pro Val Leu Gin Thr Lys His Arg Ala Arg Ala Val Thr Cys Lys 
370 375 380 

Ser Thr Ala Glu Leu Glu Ala Glu Glu Leu Glu Lys Leu Gin Gin Tyr 
385 390 395 400 

Lys Phe Lys Ala Arg Glu Leu Asp Pro Arg He Leu Glu Gly Gly Pro 
405 410 415 

He Leu Pro Lys Lys Pro Pro Val Lys Pro Pro Thr Glu Pro He Gly 
420 425 430 



Phe Asp Leu Glu He Glu Lys Arg He Gin Glu Arg Glu Ser Lys Lys 
435 440 445 
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Lys Thr Glu Asp Glu His Phe Glu Phe His Ser Arg Pro Cys Pro Thr 
450 455 460 



Lys lie Leu Glu Asp Val Val Gly Val Pro Glu Lys Lys Val Leu Pro 
465 470 475 480 



lie Thr Val Pro Lys Ser Pro Ala Phe Ala Leu Lys Asn Arg lie Arg 
485 490 * 495 



Met Pro Thr Lys Glu Asp Glu Glu Glu Asp Glu Pro Val Val lie Lys 
500 505 510 



Ala Gin Pro Val Pro His Tyr Gly Val Pro Phe Lys Pro Gin lie Pro 
515 520 525 



Glu Ala Arg Thr Val Glu He Cys Pro Phe Ser Phe Asp Ser Arg Asp 
530 535 540 



Lys Glu Arg Gin Leu Gin Lys Glu Lys Lys He Lys Glu Leu Gin Lys 

545 550 555 560 

Gly Glu Val Pro Lys Phe Lys Ala Leu Pro Leu Pro His Phe Asp Thr 

565 570 575 



He Asn Leu Pro Glu Lys Lys Val Lys .Asn Val Thr Gin He Glu Pro 
580 585 590 

Phe Cys Leu Glu Thr Asp Arg Arg Gly Ala Leu Lys Ala Gin Thr Trp 
595 600 605 



Lys His Gin Leu Glu Glu Glu Leu Arg Gin Gin Lys Glu Ala Ala Cys 
fi 10 615 620 



Phe Lys Ala Arg Pro Asn Thr Val He Ser Gin Glu Pro Phe Val Pro 
625 630 635 640 

Lys Lys Glu Lys Lys Ser Val Ala Glu Gly Leu Ser Gly Ser Leu Val 
645 650 655 

Gin Glu Pro Phe Gin Leu Ala Thr Glu Lys Arg Ala Lys Glu Arg Gin 

660 665 670 



Glu Leu Glu Lys Arg Met Ala Glu Val Glu Ala Gin Lys Ala Gin Gin 
675 680 685 



Leu Glu Glu Ala Arg Leu Gin Glu Glu Glu Gin Lys Lys Glu Glu Leu 
690 695 700 



Ala Arg Leu Arg Arg Glu Leu Val His Lys Ala Asn Pro He Arg Lys 
705 710 715 720 
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Tyr Gin Gly Leu Glu He Lys Ser Ser Asp Gin Pro Leu Thr Val Pro 
725 730 735 



Val Ser Pro Lys Phe Ser Thr Arg Phe His Cys 
740 745 



<210> 35 

<211> 1246 

<212> DNA 

<213> NM_001769 CD9 

<220> 

<221> CDS 

<222> (112) . . (795) 

<223> 



<400> 35 

gaccagccta cagccgcctg catctgtatc cagcgccagg tcccgccagt cccagctgcg 60 

cgcgcccccc agtcccgcac ccgttcggcc caggctaagt tagccctcac c atg ccg 117 

Met Pro 
1 

gtc aaa gga ggc acc aag tgc ate aaa tac ctg ctg ttc gga ttt aac 165 
Val Lys Gly Gly Thr Lys Cys He Lys Tyr Leu Leu Phe Gly Phe Asn 
5 10 15 

ttc ate ttc tgg ctt gec ggg att get gtc ctt gee att gga eta tgg 213 
Phe lie Phe Trp Leu Ala Gly He Ala Val Leu Ala He Gly Leu Trp 
20 25 30 

etc cga ttc gac tct cag acc aag age ate ttc gag caa gaa act aat 261 
Leu Arg Phe Asp Ser Gin Thr Lys Ser He Phe Glu Gin Glu Thr Asn 
35 40 45 50 

aat aat aat tec age ttc tac aca gga gtc tat att ctg ate gga gec 309 
Asn Asn Asn Ser Ser Phe Tyr Thr Gly Val Tyr He Leu He Gly Ala 
55 60 65 

ggc gee etc atg atg ctg gtg ggc ttc ctg ggc tgc tgc ggg get gtg 357 
Gly Ala Leu Met Met Leu Val Gly Phe Leu Gly Cys Cys Gly Ala Val 
70 75 80 

cag gag tec cag tgc atg ctg gga ctg ttc ttc ggc ttc etc ttg gtg 405 
Gin Glu Ser Gin Cys Met Leu Gly Leu Phe Phe Gly Phe Leu Leu Val 
85 90 95 

ata ttc gee att gaa ata get gcg gee ate tgg gga tat tec cac aag 453 
He Phe Ala He Glu He Ala Ala Ala He Trp Gly Tyr Ser His Lys 
100 105 HO 

gat gag gtg att aag gaa gtc cag gag ttt tac aag gac acc tac aac 501 
Asp Glu Val He Lys Glu Val Gin Glu Phe Tyr Lys Asp Thr Tyr Asn 
115 120 . 125 130 

aag ctg aaa acc aag gat gag ccc cag egg gaa acg ctg aaa gec ate 549 
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Lys Leu Lys Thr Lys Asp Glu Pro Gin Arg Glu Thr Leu Lys Ala He 
135 140 145 

cac tat gcg ttg aac tgc tgt ggt ttg get ggg ggc gtg gaa.cag ttt 597 
His Tyr Ala Leu Asn Cys Cys Gly Leu Ala Gly Gly Val Glu Gin Phe 
150 155 160 

ate tea gac ate tgc ccc aag aag gac gta etc gaa ace ttc ace gtg 645 
He Ser Asp He Cys Pro Lys Lys Asp Val Leu Glu Thr Phe Thr Val 
165 170 175 

aag tec tgt cct gat gee ate aaa gag gtc ttc gac aat aaa ttc cac 693 
Lys Ser Cys Pro Asp Ala He Lys Glu Val Phe Asp Asn Lys Phe His 
180 195 190 

ate ate ggc gca gtg ggc ate ggc att gee gtg gtc at g ata ttt ggc 741 
He He Gly Ala Val Gly He Gly He Ala Val Val Met He Phe Gly 
195 200 205 " 210 

atg ate ttc agt atg ate ttg tgc tgt get ate cgc agg aac cgc gag 789 
Met He Phe Ser Met He Leu Cys Cys Ala He Arg Arg Asn Arg Glu 
215 220 225 

Met Val tagagtcagC ttacatc cct gagcaggaaa gtttacccat gaagattggt 845 



gggatttttt gtttgtttgt tttgttttgt ttgttgtttg ttgtttgttt ttttgecact 905 

aattttagta ttcattctgc attgetagat aaaagctgaa gttactttat gtttgtcttt 965 

.taatgettea ttcaatattg acatttgtag ttgagcgggg ggtttggttt gctttggttt 1025 

atattttttc agttgtttgt ttttgcttgt tatattaagc agaaatcctg caatgaaagg 1085 

tactatattt gctagactct agacaagata ttgtacataa aagaattttt ttgtctttaa 1145 

atagatacaa atgtctatca actttaatca agttgtaact tatattgaag acaatttgat 1205 

acataataaa aaattatgac aatgtcaaaa aaaaaaaaaa a 1246 



<210> 


36 


<211> 


228 


<212> 


PRT 


<213> 


NMJ 


<400> 


36 



Met Pro Val Lys Gly Gly Thr Lys Cys He Lys Tyr Leu Leu Phe Gly 
1 5 10 u 



15 



Phe Asn Phe He Phe Trp Leu Ala Gly He Ala Val Leu Ala He Gly 
20 25 30 



Leu Trp Leu Arg Phe Asp Ser Gin Thr Lys Ser He Phe Glu Gin Glu 
35 40 45 



Thr Asn Asn Asn Asn Ser Ser Phe Tyr Thr Gly Val Tyr He Leu He 
50 55 60 



n 
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Gly Ala Gly Ala Leu Met Met Leu Val Gly Phe Leu Gly Cys Cys Gly 
65 70 75 80 



Ala Val Gin Glu Ser Gin Cys Met Leu Gly Leu Phe Phe Gly Phe Leu 
85 90 95 



Leu Val He Phe Ala He Glu He Ala Ala Ala He Trp Gly Tyr Ser 
100 105 110 



His Lys Asp Glu Val He Lys Glu Val Gin Glu Phe Tyr Lys Asp Thr 
115 120 125 



Tyr Asn Lys Leu Lys Thr Lys Asp Glu Pro Gin Arg Glu Thr Leu Lys 
130 135 . 140 



Ala lie His Tyr Ala Leu Asn Cys Cys Gly Leu Ala Gly Gly Val Glu 
145 150 155 160 



Gin Phe He Ser Asp He Cys Pro Lys Lys Asp Val Leu Glu Thr Phe 
165 170 ' 175 



Thr Val Lys Ser Cys Pro Asp Ala He Lys Glu Val Phe Asp Asn Lys 
180 185 190 



Phe His He He Gly Ala Val Gly He Gly He Ala Val Val Met He 
195 200 205 



Phe Gly Met He Phe Ser Met He Leu Cys Cys Ala He Arg Arg Asn 
210 215 220 



Arg Glu Met Val 
225 



<210> 37 

<211> 7557 

<212> DNA 

<213> NM_020859 ShrmL, Shroom-related protein 



<220> 

<221> CDS 

<222> (958) (6942) 

<223> 



<400> 37 

ggttattaac tgtaagactg tttgaattgt caggtctttg tggtgagctg aattctgcat 60 
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ggcagtgctt 


actgtgcaga 


aaggttgcaa 


tgcctcactt tgccttaatc 


catgttcctc 


120 


taacttcctt 


actctttctc 


tcaagaggag 


gcaagtggct gtggcggccg 


cagcagtggc 


180 


tgatcatcac 


tgaaaatacc 


aaagaaaaga 


actgagctgc ctccttcata 


ttttttccat 


240 


tgaggattaa 


tttaccgtgc 


tttttcattt 


tctctacatc ctgcaaaagt 


ttttttctct 


300 


cctaagaaac 


aaactatgaa 


ctgattgttg 


aaaaaaagaa gtaaaaagtt 


ttagcacagc 


360 


ttctctgtct 


cttcgggaca 


agttagaaaa 


ttctgaagtg agccgaagca 


tagtaagtgc 


420 


tttctttctt 


tttaagctac 


ttctggggag 


ggaggaggct attgtaatgg 


taaatttcac 


480 


tccgagagga 


agaaagggtt 


gataatcaat 


caaaaatgag gtattccttt 


gagtatttgt 


540 


gattttctta 


ctatattgta 


agatgctttt 


aattttctct gtaaaatagg 


cagaaatggt 


600 


tttagtgtgt 


gtatgtgtga 


aataaaagct 


cagaaaagca atottcagag 


cgccactgaa 


660 


ggaagttttg 


acgaacggag 


tagagatgta 


taccacttgg gggcttcagt 


gagaacccag 


720 


aattcctgga 


ggaggattta 


cattcagaaa 


tgttgaagtg aaaattcctt 


ctggttcagc 


780 


atcttggagt 


tcagcttgga 


agaacatttt 


acgtatggaa gaatttgctt 


ctccaaacct 


840 


ctcttttggc 


cattgtgtgt 


cctgaaggat 


gggacaactt gtgctgtaga 


agcactgctt 


900 


gcctgagttt 


gcttcaggca 


ttttaaattt 


aacttgaggg atcatgtgtt 


tggcatg 


957 


atg agg acc act gaa gac ttc cac aag cct agt gcc aca tta aac tct 
Met Arg Thr Thr Glu Asp Phe His Lys Pro Ser Ala Thr Leu Asn Ser 


1005 



10 



15 



aac acg gcc acc aag gga agg tac att tat ctg gag gca ttc ctg gag 
Asn Xhr Ala Thr Lys Gly Arg Tyr He Tyr Leu Glu Ala Phe Leu Glu 
20 25" 30 



1053 



gga gga get ccc tgg ggt .ttt act eta aag ggt ggc ctg gag cac gga 
Gly Gly Ala Pro Trp Gly Phe Thr. Leu Lys Gly Gly Leu Glu His Gly 
35 40 45 



1101 



gaa cca tta ate ate tct aag gtc gaa gaa ggg ggc aaa gca gac acc 
Glu Pro Leu He He Ser Lys Val Glu Glu Gly Gly Lys Ala Asp Thr 
50 55 60 



1149 



ctg age tec aaa ctg cag get ggg gat gag gtt gtg cac ate aat gag 
Leu Ser Ser Lys Leu Gin Ala Gly Asp Glu Val Val. His He Asn Glu 
' 65 70 75 80 



1197 



gtg act ctg age age tec aga aag gag gca gtt tec ctg gtg aaa gga 
Val Thr Leu Ser Ser Ser Arg Lys Glu Ala Val Ser Leu Val Lys Gly 
85 90 95 



1245 



tec tac aag acc etc agg ctg gta gtg cgc aga gat gtg tgc aca gac 
Ser Tyr Lys Thr Leu Arg Leu Val Val Arg Arg Asp Val Cys Thr Asp 
100 105 110 



1293 



cca ggc cat gca gat act ggt gcc tct aac ttc gtc age cca gaa cac 
Pro Gly His Ala Asp Thr Gly Ala Ser Asn Phe Val Ser Pro Glu His 
115 120 125 



1341 



etc acc tct ggc ccc cag cac agg aaa gca gcg tgg tea gga ggg gtt 
Leu Thr Ser Gly Pro Gin His Arg Lys Ala Ala Trp Ser Gly Gly Val 
130 135 140 



1389 



aaa ctt egg ctg aag cac agg tct agt gag cct gca ggc cga cct cac 



1437 
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Lys Leu Arg Leu Lys His Arg Ser Ser Glu Pro Ala Gly Arg Pro His 
145 150 .155 160 

teg tgg cac aca act aaa tct ggg gag aag caa ccc gat gec age atg 1485 
Ser Trp His Thr Thr Lys Ser Gly Glu Lys Gin Pro Asp Ala Ser Met 
165 170' 175 

atg cag ata tct cag ggt atg ate ggc cct cct tgg cac caa age tac 1533 
Met Gin lie Ser Gin Gly Met lie Gly Pro Pro Trp His Gin Ser Tyr 
180 185 ~ 190 

cat tec age tec tct act agt gac etc tec aac tat gac cat get tat 1581 
His Ser Ser Ser Ser Thr Ser Asp Leu Ser Asn Tyr Asp His Ala Tyr 
195 200 205 

eta agg egg age cct gac cag tgc age tec cag ggg age atg gag age 1629 
Leu Arg Arg Ser Pro Asp Gin Cys Ser Ser Gin Gly Ser Met Glu Ser 
21 0 215 220 

ctg gag ccc agt ggg gca tac cca ccc tgt cat ctt tec cct gee aag 1677 
Leu Glu Pro Ser Gly Ala Tyr Pro Pro Cys His Leu Ser Pro Ala Lys 
225 230 235 240 

tec acc ggc age att gac cag etc age cac ttc cat aac aag aga gac 1725 
Ser Thr Gly Ser lie Asp Gin Leu Ser His Phe His Asn Lys Arg Asp 
245 250 255 

teg get tac age tct ttc tec acc agt "tct age ate eta gag tat cca 1773 
Ser Ala Tyr Ser Ser Phe Ser Thr Ser Ser Ser lie Leu Glu Tyr Pro 
260 265 270 

cac cct ggc ate tct gee egg gag cgt tea ggc tec atg gac aat act 1821 
His Pro Gly lie Ser Ala Arg Glu Arg Ser Gly Ser Met Asp Asn Thr 
275 280 285 

tct get cga ggt ggc etc etc gaa ggg atg agg cag gca gat att cgc 1869 
Ser Ala Arg Gly Gly Leu Leu Glu Gly Met Arg Gin Ala Asp lie Arg 
290 295 * 300 

tat gtc aag aca gtc tat gac acc egg agg gga gtc tea gca gag tat 1917 
Tyr Val Lys Thr Val Tyr Asp Thr Arg Arg Gly Val Ser Ala Glu Tyr 
305 310 315 320 

gag gtg aac tct tea gee ctg ctg ctt caa ggt agg gag gee cga gee 1965 
Glu Val Asn Ser Ser Ala Leu Leu Leu Gin Gly Arg Glu Ala Arg Ala 
325 330 335 

tea gca aat ggt cag ggc tat gat aaa tgg tct aat att cct egg ggc 2013 
Ser Ala Asn Gly Gin Gly Tyr Asp Lys Trp Ser Asn lie Pro Arg Gly 
340 345 350 

aag gga gtg cca ccc cca tec tgg age cag cag tgc ccc agt tec ttg 2061 
Lys Gly Val Pro Pro Pro Ser Trp Ser Gin Gin Cys Pro Ser Ser Leu 
355 360 365 

gag act gec acg gac aac ctt cct cct aag gtg ggt gca ccc ctg cct 2109 
Glu Thr Ala Thr Asp Asn Leu Pro Pro Lys Val Gly Ala Pro Leu Pro 
370 375 380 

cca get egg agt gac agt tac gca gca ttt egg cac cgt gag egg ccc 2157 
Pro Ala Arg Ser Asp Ser Tyr Ala Ala Phe Arg His Arg Glu Arg Pro 
385 390 395 400 

age tec tgg tct age ctt gat cag aaa egg etc tgc egg cct cag gca 2205 
Ser Ser Trp Ser Ser Leu Asp Gin Lys Arg Leu Cys Arg Pro Gin Ala 
405 410 415 

aac tct tta ggc tec ctg aag tct cca ttc ata gag gag cag ctg cat 2253 
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Asn Ser Leu Gly Ser Leu Lys Ser Pro Phe lie Glu Glu Gin Leu His 

/ion 



420 425 



430 



act gtg ctg gag aag agt cca gag aac age ccc cca gtg aag ccc aag 
Thr Val Leu Glu Lys Ser Pro Glu Asn Ser Pro Pro Val Lys Pro Lys 
435 .440 445 

cat aac tat acc cag aag gcc caa cct ggc caa cct ctg ctg ccg acc 
His Asn Tyr Thr Gin Lys Ala Gin Pro Gly Gin Pro Leu Leu Pro Thr 
450 455 4 60 

age ate tac gcg gta cct tec ctg gag cca cac ttt gcc cag gtg cct 2397 
Ser lie Tyr Ala Val Pro Ser Leu Glu Pro His Phe Ala Gin Val Pro 



465 470 



475 480 



ctg gag aga gaa ggc eta ggc cag age ctg tea ggc aac ttt ggc aag 

Leu Glu Arg Glu Gly Leu Gly Gin Ser Leu Ser Gly Asn Phe Gly Lys 
645 650 655 

acc aag tea gcc ttc tea tct etc cag aac att cct gag agt ctg aga 

Thr Lys Ser Ala Phe Ser Ser Leu Gin Asn lie Pro Glu Ser Leu Arg 
660 665 670 



2301 



2349 



cag cct tct gtg agt age aac ggt atg etc tac cct gca ctg gcc aag 2445 
Gin Pro Ser Val Ser Ser Asn Gly Met Leu Tyr Pro Ala Leu Ala Lys 
485 490 495 

gag agt gga tac ata gcc cct cag gga gca tgc aac aag atg get acc 2493 
Glu Ser Gly Tyr lie Ala Pro Gin Gly Ala Cys Asn Lys Met Ala Thr 
500 505 . 510 

att gat gag aat ggg aac cag aat gga tct ggc agg cct ggg ttt gcc 2541 
lie Asp Glu Asn Gly Asn Gin Asn Gly Ser Gly Arg Pro Gly Phe Ala 
515 520 525 

ttc tgc cag ccc tta gaa cat gac ttg ctg tec cca gtg gag aag aaa 2589 
Phe Cys Gin Pro Leu Glu His Asp Leu Leu Ser Pro Val Glu Lys Lys 
530 535 540 

cca gaa get aca gcc aag tat gtc ccc tec aaa gtc cat ttc tgt tea 2637 
Pro Glu Ala Thr Ala Lys Tyr Val Pro Ser Lys Val His Phe Cys Ser 
545 550 555 560 

gtg cct gaa aat gag gag gat gcc tec ctg aag aga cat etc aca cct 2685 
Val Pro Glu Asn Glu Glu Asp Ala Ser Leu Lys Arg His Leu Thr Pro 
565 570 575 

ccc caa ggc aac age cca cat tec aat gag aga aag age acc cac agt 2733 
Pro Gin Gly Asn Ser Pro His Ser Asn Glu Arg Lys Ser Thr His Ser 
580 585 -590 

aac aaa cca tct tct cat ccc cac age etc aaa tgc cct cag get edg 2781 
Asn Lys Pro Ser Ser His Pro His Ser Leu Lys Cys Pro Gin Ala Gin 
595 600 605 

gcc tgg caa gcg ggt gaa gac aag aga tct tec agg etc tea gag ccc 2829 
Ala Trp Gin Ala Gly Glu Asp Lys Arg Ser Ser Arg Leu Ser Glu Pro 
610 615 620 

tgg gag ggc gat ttc cag gaa gac cac aat gcc aac etc tgg agg agg 2877 
Trp Glu Gly Asp Phe Gin Glu Asp His Asn Ala Asn Leu Trp Arg Arg 
625 630 635 640 



2925 



2973 



aga cac age age ctg gag eta ggc egg gga acc cag gag ggt tac ccc 3021 
Arg His Ser Ser Leu Glu Leu Gly Arg Gly Thr Gin Glu Gly Tyr Pro 
675 680 685 

ggg ggc agg ccc acc tgt gca gtc aac acc aag gca gaa gac cct ggg 3069 
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Gly Gly Arg Pro Thr Cys Ala Val Asn Thr Lys Ala Glu Asp Pro Gly 
690 695 700 

agg aaa gcc get cct gac etc ggg age cat ctg gac egg cag gtt tec 3117 
Arg Lys Ala Ala Pro Asp Leu Gly Ser His Leu Asp Arg Gin Val Ser 
705 710 715 720 

tac ccg egg ccc gag ggg agg ace ggt gcc teg get tct ttc aac age 3165 
Tyr Pro Arg Pro Glu Gly Arg Thr Gly Ala Ser Ala Ser Phe Asn Ser 
725 730 735. 

aca gac cca agt ccc gaa gag ccg cct gcc ccc teg cac ccg cac aca 3213 
Thr Asp Pro Ser Pro Glu Glu Pro Pro Ala Pro Ser His Pro His Thr 
740 745 750 

tec agt ctg ggc egg agg ggg ccc ggc cca ggc age gcc teg get ctt 3261 
Ser Ser Leu Gly Arg Arg Gly Pro Gly Pro Gly Ser Ala Ser Ala Leu 
755 760 765 

cag ggc ttt cag tac ggg aag ccc cac tgc teg gtg ctg gag aag gtc 3309 
Gin Gly Phe Gin Tyr Gly Lys Pro His Cys Ser Val Leu Glu Lys Val 
770 775 780 

tec aaa ttc gag cag cga gag caa ggg age cag aga ccg agt gtg ggc 3357 
Ser Lys Phe Glu Gin Arg Glu Gin Gly Ser Gin Arg Pro Ser Val Gly 
785 790 795 800 

ggc tct ggt ttt ggc cat aac tat agg ccc cac agg acc gtc tea act 3405 
Gly Ser Gly Phe Gly His Asn Tyr Arg Pro His Arg Thr Val Ser Thr 
805 , 810 815 

tec agt act tct ggg aat gac ttc gag gag aca aaa gca cac att cgt 3453 
Ser Ser Thr Ser Gly Asn Asp Phe Glu Glu Thr Lys Ala His He Arg 
820 825 830 

ttc tct gag tea get gaa ccc eta ggc aac ggg gag cag cac ttc aaa 3501 
Phe Ser Glu Ser Ala Glu Pro Leu Gly Asn Gly Glu Gin His Phe Lys 
835 840 845 

aac ggg gag ctg aag ttg gaa gag get tec egg cag ccc tgc ggt cag 3549 
Asn Gly Glu Leu Lys Leu Glu Glu Ala Ser Arg Gin Pro Cys Gly Gin 
850 855 860 

cag ctg age gga gga gcg teg gac age ggc cgt ggc ccc cag agg ccg 3597 
Gin Leu Ser Gly Gly Ala Ser Asp Ser Gly Arg Gly Pro Gin Arg Pro 
865 870 875 880 

gac get egg etc etc cgt age cag age acc ttc cag etc tec age gag 3645 
Asp Ala Arg Leu Leu Arg Ser Gin Ser Thr Phe Gin Leu Ser Ser Glu 
885 '890 895 

cca gag agg gag ccc gag tgg egg gac agg ccc ggc teg ccc gaa teg 3693 
Pro Glu Arg Glu Pro Glu Trp Arg Asp Arg Pro Gly Ser Pro Glu Ser 
900 905 910 

ccc ctg ctg gat gcc ccc ttc age cgc gcc tac egg aac age ate aag 3741 
Pro Leu Leu Asp Ala Pro Phe Ser Arg Ala Tyr Arg Asn Ser He Lys 
915 920 925 

gac gca cag tec cgt gtc ttg ggg gcc acc tec ttt cga cgt cga gac 3789 
Asp Ala Gin Ser Arg Val Leu Gly Ala Thr Ser Phe Arg Arg Arg Asp 
930 935 940 

ctg gag ctg ggg gcg ccc gtg gcg teg agg tec tgg egg cca egg cct 3837 
Leu Glu Leu Gly Ala Pro Val Ala Ser Arg Ser Trp Arg Pro Arg Pro 
945 950 955 960 

tec teg gcc cac gtg ggg ctg egg age ccc gag gcg teg gcc tec gcc 3885 
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Ser Ser Ala His Val Gly Leu Arg Ser Pro Glu Ala Ser Ala Ser Ala 
965 970 . 975 



tec ccg cac acg ccc egg gag egg cac age gtg ace cct get gag ggc 
Ser Pro His Thr Pro Arg Glu Arg His Ser Val Thr Pro Ala Glu Gly 
980 985 990 



'3933 



gac ctg gee agg ccc gtg ccc cct gec gec egg aga ggt get cgc egg 
Asp Leu Ala Arg Pro Val Pro Pro Ala Ala Arg Arg Gly Ala Arg Arg 
995 1000 1005 



3981 



cgc ctg act ccc gag cag aag aag cgc tec tac teg gag ccc gag 
Arg Leu Thr Pro Glu Gin Lys Lys Arg Ser Tyr Ser Glu Pro Glu 
1010 1015 1020 



4026 



aag atg aac gag gtg ggg ate gtg gag gag gee gaa ccg gca ccc 
Lys Met Asn Glu Val Gly He Val Glu Glu Ala Glu Pro Ala Pro 
1025 1030 1035 



4071 



ctg ggc ccg cag aga aat ggg atg cgt ttc ccg gag age age gtg 
Leu Gly Pro Gin Arg Asn Gly Met Arg Phe Pro Glu Ser Ser Val 
1040 1045 1050 



4116 



gee gac egg cgc cgt etc. ttc 
Ala Asp Arg Arg Arg Leu Phe 
1055 1060 



gag cgc gat ggc aag gee tgc tec 
Glu Arg Asp Gly Lys Ala Cys Ser 
1065 



4161 



acg etc age ctg teg ggg ccc gag ctg aag cag ttc cag cag age 
Thr Leu Ser Leu Ser Gly Pro Glu Leu Lys Gin Phe Gin Gin Ser 
1070 1075 1080 



4206 



gec ctg gcg gac tac ate cag cgc aag ace ggc aag egg cct ace 
Ala Leu Ala Asp Tyr He Gin Arg Lys Thr Gly Lys Arp Pro Thr 
1085 1090 1095 



4251 



tec gee gec ggc tgc age etc cag gag ccc ggg cca ctg cgt gag 
Ser Ala Ala Gly Cys Ser Leu Gin Glu Pro Gly Pro Leu Arg Glu 
1100 1105 1110 



4296 



cgc gee cag agt gee tac etc cag ccc ggc ccc gcg gcg etc gaa 
Arg Ala Gin Ser Ala Tyr Leu Gin Pro Gly Pro Ala Ala Leu Glu 
1115 1120 1125 



4341 



ggc tec ggc etc gee teg gee 
Gly Ser Gly Leu Ala Ser Ala 
1130 1135 



tec age ttg age tea 
Ser Ser Leu Ser Ser 
1140 



ctg egg gag 
Leu Arg Glu 



4386 



ccc age ctg cag ccc cgc agg gag gee acg etc ctg ccg gec aca 
Pro Ser Leu Gin Pro Arg Arg Glu Ala Thr Leu Leu Pro Ala Thr 
1145 1150 1155 



4431 



gtt gca gaa ace cag cag get ccc cga gat cgc age age tec ttc 
Val Ala Glu Thr* Gin Gin Ala Pro Arg Asp Arg Ser Ser Ser Phe 
1160 1165 1170 



4476 



gee ggt ggc cgc cgc etc ggg gaa egg cga cgc ggg gac ctg ctt 
Ala Gly Gly Arg Arg Leu Gly Glu Arg Arg Arg Gly Asp Leu Leu 
1175 1180 1185 



4521 



age gga gca aac ggt gga aca agg ggc ace cag aga ggg gat gag 
Ser Gly Ala Asn Gly Gly Thr Arg Gly Thr Gin Arg Gly Asp Glu 
1190 1195 1200 



4566 



ace ccc agg gag cca tec tec tgg ggg gee agg gec ggg aag tec 
Thr Pro Arg Glu Pro Ser Ser Trp Gly Ala Arg Ala Gly Lys Ser 
1205 1210 1215 



4611 



atg teg gec gag gac ctg ctg gaa cgc teg gac gtc ctt gcg ggc 



4656 
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Met Ser Ala Glu Asp Leu Leu Glu Arg Ser Asp Val Leu Ala Gly 
1220 1225, 1230 

cct gtc cat gtg agg. tec agg tea tct ccc gec acc gca gac aag 4701 
Pro Val His Val Arg. Ser Arg Ser Ser Pro Ala Thr Ala Asp Lys 
1235 1240 1245 

cgc cag gat gtg ctt ttg ggg caa gac agt ggc ttt ggt ctt gtg 4746 
Arg Gin Asp Val Leu Leu Gly Gin Asp Ser Gly Phe Gly Leu Val 
1250 1255 1260 

aag gat cca tgt tat. ttg get ggt cct gga tct agg tea etc agt 4791 
Lys Asp Pro Cys Tyr Leu Ala Gly Pro Gly Ser Arg Ser Leu Ser 
1265 1270 1275 

tgt tea gaa aga ggc caa gaa gag atg ctg ctg etc ttc cac cat 4836 
Cys Ser Glu Arg Gly Gin Glu Glu Met Leu Leu Leu Phe His His 
1280 1285 1290 

etc acc cct cgt tgg ggt ggt tea ggc tgc aaa gee att ggt gat 4881 
Leu Thr Pro Arg Trp Gly Gly Ser Gly Cys Lys Ala lie Gly Asp 
1295 1300 1305 

tec tec gtt cct agt gaa tgt cct gga acc ctg gac cat cag agg 4926 
Ser Ser Val Pro Ser Glu Cys Pro Gly Thr Leu Asp His Gin Arg 
1310 1315 1320 

caa gee agt agg aca ccc tgc ccc agg cca cca ctg gca gga acg 4971 
Gin Ala Ser Arg- Thr Pro Cys • Pro Arg. Pro Pro Leu Ala Gly Thr 
1325 1330 1335 

caa ggg ctg gtc aca gac acc agg get gca ccc ctg acc cca att 5016 
Gin Gly Leu Val Thr Asp Thr Arg Ala Ala Pro Leu Thr Pro lie 
1340 1345 1350 

ggc acc cct ctg cct tea gee att ccc tct ggc tac tgc tea cag 5061 
Gly Thr Pro Leu Pro Ser Ala lie Pro Ser Gly Tyr Cys Ser Gin 
1355 1360 1365 

gac ggt cag aca ggg cga cag cct etc ccg ccc tac acc cct gee 5106 
Asp Gly Gin Thr Gly Arg Gin Pro Leu Pro Pro Tyr Thr Pro Ala 
1370 1375 ' 1380 

atg atg cac aga age aat ggt cac acc ctg acc cag cct ccc ggt 5151 
Met Met His Arg Ser Asn Gly His Thr Leu Thr Gin Pro Pro Gly 
1385 1390 1395 

cca aga ggc tgt gag ggc gat ggc cca gag cat ggg gta gaa gag 5196 
Pro Arg Gly Cys Glu Gly Asp Gly Pro Glu His Gly Val Glu Glu 
1400 1405 1410 

gga acg agg aag agg gtc teg ctg cct cag tgg cca cct cct tct 5241 
Gly Thr Arg Lys Arg Val Ser Leu Pro Gin Trp Pro Pro Pro Ser 
1415 1420 1425 

cga gca aag tgg gee cac gca gee aga gag gac age ctt cct gag 5286 
Arg Ala Lys Trp Ala His Ala Ala Arg Glu Asp Ser Leu Pro Glu 
1430 1435 1440 

gaa tec tea gee cct gat ttt gca aac ctg aag cac tat caa aaa 5331 
Glu Ser Ser Ala Pro Asp Phe Ala Asn Leu Lys His Tyr Gin Lys 
1445 1450 1455 

cag cag agt ctt cca agt tta tgc age act tct gac cca gac aca 5376 
Gin Gin Ser Leu Pro Ser Leu Cys Ser Thr Ser Asp Pro Asp Thr 
1460 1465 1470 

cct ctt ggg gee ccg age act cca ggg agg ate tec etc cga ata 5421 
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Pro Leu Gly Ala Pro Ser Thr Pro Gly Arg He Ser Leu Arg lie 
1475 1480 1485 



tct gag tct gtc ctg egg gac tec ccg cca cct cat gag gat tat 
Ser Glu Ser Val Leu Arg Asp Ser Pro Pro Pro His Glu Asp Tyr 
1490 1495 1500 



5466 



gaa gac gaa gtg ttt gtg agg gat ccg cac ccc aag gec acg tec 
Glu Asp Glu Val Phe Val Arg Asp Pro His Pro Lys Ala Thr Ser 
1505 1510 1515 



5511 



age ccc aca ttt gaa cct ctt 
Ser Pro Thr Phe Glu Pro Leu 
1520 1525 



ccc cca ccc cca cct cct cca ccg 
Pro Pro Pro Pro Pro Pro Pro Pro 
1530 



5556 



agt cag gaa acc ccg gtg tat age atg gat gac ttc cct cca cct 
Ser Gin Glu Thr Pro Val Tyr Ser Met Asp Asp Phe Pro Pro Pro 
1535 1540 1545 



5601 



cct ccc cac act gta tgt gag 
Pro Pro His Thr Val Cys Glu 
1550 1555 



gcg cag ctg gac agt gag gat ccc 
Ala Gin Leu Asp Ser Glu Asp Pro 
1560 



5646 



gag ggg cca cgc ccc age ttc aac aaa ctt tct aaa gtg aca att 
Glu Gly Pro Arg Pro Ser Phe .Asn Lys Leu Ser Lys Val Thr He 
1565 1570 1575 



5691 



gca agg gaa agg cac atg cct ggt gca gec cat gtg gta ggt agt 
Ala Arg Glu Arg His Met Pro Gly Ala Ala His Val Val Gly Ser 
1580 1585 1590 



5736 



cag aca ctg get tec aga etc caa act tct ate aag ggt tea gag 
Gin Thr Leu Ala Ser Arg Leu Gin Thr Ser He Lys Gly Ser Glu 
1595 1600 1605 



5781 



get gag tec aca cca ccc tec ttc atg age gtt cac gec caa .ctt 
Ala Glu Ser Thr Pro Pro Ser Phe Met Ser Val His Ala Gin Leu 
1610 1615 1620 



5826 



get ggg tct ctt ggt ggg cag cca gca- ccc ate cag act caa age 
Ala Gly Ser Leu Gly Gly Gin Pro Ala Pro He Gin Thr Gin Ser 
1625 1630 1635 



5871 



etc age cat gat cca gtc agt 
Leu Ser His Asp Pro Val Ser 
1640 1645 



gga act cag ggt tta 
Gly Thr Gin Gly Leu 
1650 



gaa aag aaa 
Glu Lys Lys 



5916 



gtc agt cct gat cct cag aag agt tea gaa gac ate aga aca gag 
Val Ser Pro Asp Pro Gin Lys Ser Ser Glu Asp He Arg Thr Glu 
1655 1660 1665 



5961 



get ttg gec aag gaa att gtc cac caa gac aaa tct eta gca gac 
Ala Leu Ala Lys Glu He Val His Gin Asp Lys Ser Leu Ala Asp 
1670 1675 » 1680 



6006 



att ttg gat cca gac tec agg ctg aag aca aca atg gac ctg atg 
He Leu Asp Pro Asp Ser Arg Leu Lys Thr Thr Met Asp Leu Met 
. 1685 1690 1695 



6051 



gaa ggt ttg ttt ccc cga gat gtg aac ttg ctg aag gaa aac agt 
Glu Gly Leu Phe Pro Arg Asp Val Asn Leu Leu Lys Glu Asn Ser 
1700 1705 1710 



6096 



gta aag agg aag gec ata cag 
Val Lys Arg Lys Ala He Gin 
1715 1720 



aga act gtc age tct 
Arg Thr Val Ser Ser 
1725 



tea gga tgt 
Ser Gly CyS 



6141 



gaa ggc aag agg aat gaa gac aag gaa gca gtg age atg ttg gtt 



6186 
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Glu Gly Lys Arg Asn Glu Asp Lys Glu Ala Val Ser Met Leu Val 
1730 1735 1740 



aac tgc cct gcc tac tac agt gtg tct get ccc aag get gag eta 
Asn Cys Pro Ala Tyr Tyr Ser Val Ser Ala Pro Lys Ala Glu Leu 
1745 1750 1755 



6231 



ctg aac aaa ate aaa gag atg 
Leu Asn Lys He Lys Glu Met 
1760 1765 



cca gca gaa gtg aat 
Pro Ala Glu Val Asn 
1770 



gaa cag gca gat gtc aat gaa aag aag get gag etc 
Glu Gin Ala Asp Val Asn Glu Lys Lys Ala Glu Leu 
1775 1780 1785 



gag gaa gag 
Glu Glu Glu 



att gga agt 
He Gly Ser 



6276 



6321 



etc ace cac aag ctg gag acc 
Leu Thr His Lys Leu Glu Thr 
1790 1795 



etc cag gag gcg aag ggg age ctg 
Leu Gin Glu Ala Lys Gly Ser Leu 
1800 



6366 



etc acg gac ate aag etc aac aac gcc ctg gga gaa 
Leu Thr Asp He Lys Leu Asn Asn Ala Leu Gly Glu 
1805 1810 1815 



gag gtg gag 
Glu Val Glu 



6411 



get ctg ate age gag etc tgc aag ccc aat gag ttt gac aag tat 
Ala Leu He Ser Glu Leu Cys Lys Pro Ash Glu Phe Asp Lys Tyr 
1820 1825 1830 



6456 



agg atg ttc ata ggg gat ttg gac aag gtg gtc aac ctg ctg etc 
Arg Met Phe He Gly Asp Leu Asp Lys Val Val Asn Leu Leu Leu 
1835 1840 1845 



6501 



tec etc teg ggg cgt eta gcc cgt gtt gag aat gtc ctt age ggc 
Ser Leu Ser Gly Arg Leu Ala Arg Val Glu Asn Val Leu Ser Gly 
1850 1855 i860 



6546 



ctt ggt gaa gat gcc agt aat gaa gaa agg age tct ctt tac gag 
Leu Gly Glu Asp Ala Ser Asn Glu Glu Arg Ser Ser . Leu Tyr Glu 
1865 1870 1875 



6591 



aaa agg aag ate ctg get ggt cag cat gag gat gcc egg gag ctg 
Lys Arg Lys He Leu Ala Gly Gin His Glu Asp Ala Arg Glu Leu 
1880 1885 1890 



6636 



aag gag aac ctg gat cgc agg gag cga gta gtg ctg ggc ate ttg 
Lys Glu Asn Leu Asp Arg Arg Glu Arg Val Val Leu Gly He Leu 
1895 1900 1905 



gcc aat tac ctt tea gag gag 
Ala Asn Tyr Leu Ser Glu Glu 
1910 1915 



cag etc cag gac tac cag cac ttc 
Gin Leu Gin Asp Tyr Gin His Phe 
• 1920 



6681 



6726 



gtg aaa atg aag tec acg etc etc att gag caa egg aag ctg gat 
Val Lys Met Lys Ser Thr Leu Leu He Glu Gin Arg Lys Leu Asp 
1925 1930 1935 



6771 



gac aag ate aag ctg ggc cag 
Asp Lys He Lys Leu Gly Gin 
1940 1945 



gag cag gtc aag tgt 
Glu Gin Val Lys Cys 
1950 



ctg ctg gag 
Leu Leu Glu 



6816 



age ctg ccc tea gat ttc att ccc aag. get ggg gcc ctg get ctg 
Ser Leu Pro Ser. Asp Phe He Pro Lys Ala Gly Ala Leu Ala Leu 
' 1955 I960 1965 



6861 



ccc cca aac etc acg agt gag ' ccc att cct get ggg ggc tgt act 
Pro Pro Asn Leu Thr Ser Glu Pro He Pro Ala Gly Gly Cys Thr 
1970 1975 1980 



6906 



ttc agt ggt att ttc cca aca tta acc tct cca ctt taacctcttc 



6952 
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Phe Ser Gly He Phe Pro Thr Leu Thr Ser Pro Leu 
1985 1990 1995 

taaaataccc aaccaaaaga tcactgtttc tctcaacact atttaatctg aaaaatgttt 7012 

cagtacaaac cactgtttga actatctggg ttattggtgt ttgttcctga tgaaaggaaa 7072 

aaaattctct ccaggaggaa gcctttttcc ttcttgccct tcctgattga tcttctgaga 7132 

gctcgaatgc tgctggacac gtaccccttt ctattattac tttgtagtag aaagaaagtt 7192 

aatgaaactg agaactgatt ggagggtgtt tgatcattta gtttttaaca ggctgaggca 7252 

acatggatca gtgtgtgtcc ccctcaggaa tgtatccaca gtggccttcc ttgctggtgg 7312 

gcagtgtatc. ctgatggcag ggtacaagta ccattaatga agggtctgca acataaagcc 7372 

ttaaaaagac acacactaag aaaactgtaa aaccttgaac attgttattt atatttttta 7432 

aaatggaaaa gatcactatg tttgttgtgc taaccactta tttgattctg ttttgtggtg 7492 

gacatagatg attacgtttg agctttgtat tttgtgaaaa ccttaatgaa atgaattcca 7552 
aagat 



7557 



<210> 38 

<211> 1995 

'<212> PRT 

<213> NM_020859 ShrmL, Shroom-related protein 

<400> 38 

Met Arg Thr Thr Glu Asp Phe His Lys Pro Ser Ala Thr Leu Asn Ser 
1 5 10 is 

Asn Thr Ala Thr Lys Gly Arg Tyr He Tyr Leu Glu Ala Phe Leu Glu 
20 25 30 

Gly Gly Ala Pro Trp Gly Phe Thr Leu Lys Gly Gly Leu Glu His Gly 
35 40 45 

Glu Pro Leu He He Ser Lys Val Glu Glu Gly Gly Lys Ala Asp Thr 
50 55 60 

Leu Ser Ser Lys Leu Gin Ala Gly Asp Glu Val Val His He Asn Glu 
65 70 75 80 

Val Thr Leu Ser Ser Ser Arg Lys Glu Ala Val Ser Leu Val Lys Gly 
85 90 95 

Ser Tyr Lys Thr Leu Arg Leu Val Val Arg Arg Asp Val Cys Thr Asp 
100 105 110 



Pro Gly His Ala Asp Thr Gly Ala Ser Asn Phe Val Ser Pro Glu His 
115 120 125 
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Leu Thr Ser Gly Pro Gin His Arg Lys Ala Ala Trp Ser Gly Gly Val 
130 " 135 ' 140 



Lys Leu Arg Leu Lys His Arg Ser Ser Glu Pro Ala Gly Arg Pro His 
145 150 155 160 



Ser Trp His Thr Thr Lys Ser Gly Glu Lys Gin Pro Asp Ala Ser Met 
165 170 175 



Met Gin lie Ser Gin Gly Met lie Gly Pro Pro Trp His Gin Ser Tyr 
180 ' 185 190 



His Ser Ser Ser Ser Thr Ser Asp Leu Ser Asn Tyr Asp His Ala Tyr 
195 200 ■ ■ . 205 



Leu Arg Arg Ser Pro Asp Gin Cys Ser Ser Gin Gly Ser Met Glu Ser 
210 215 220 



Leu Glu Pro Ser Gly Ala Tyr Pro Pro Cys His Leu Ser Pro Ala Lys 
225 230 235 240 



Ser Thr Gly Ser lie Asp Gin Leu Ser His Phe His Asn Lys Arg Asp 
245 250 255 



Ser Ala Tyr Ser Ser Phe Ser Thr Ser Ser Ser lie Leu Glu Tyr Pro 
260 265 270 



His Pro Gly lie Ser Ala Arg Glu Arg Ser Gly Ser Met Asp Asn Thr 
275 280 285 



Ser Ala Arg Gly Gly Leu Leu Glu Gly Met Arg Gin Ala Asp He Arg 
290 295 300 



Tyr Val Lys Thr Val Tyr Asp Thr Arg Arg Gly Val Ser Ala Glu Tyr 
305 310 315 320 



Glu Val Asn Ser Ser Ala Leu Leu Leu Gin Gly Arg Glu Ala Arg Ala 
325 330 335 



Ser Ala Asn Gly Gin Gly Tyr Asp Lys Trp Ser Asn He- Pro Arg Gly 
340 345 350 



Lys Gly Val Pro Pro Pro Ser Trp Ser Gin Gin Cys Pro Ser Ser Leu 
355 360 365 



Glu Thr Ala Thr Asp Asn Leu Pro Pro Lys Val Gly Ala Pro Leu Pro 
370 375 380 



Pro Ala Arg Ser Asp Ser Tyr Ala Ala Phe Arg His Arg Glu Arg Pro 
385 390 395 400 
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Ser Ser Trp Ser Ser Leu Asp Gin Lys Arg Leu Cys Arg Pro Gin Ala 
405 410 415 



Asn Ser Leu Gly Ser Leu Lys Ser Pro Phe lie Glu Glu Gin Leu His 
420 425 • 430 



Thr Val Leu Glu Lys Ser Pro Glu Asn Ser Pro Pro Val Lys Pro Lys 
435 440 445 



His Asn Tyr Thr. Gin Lys Ala Gin Pro Gly Gin Pro Leu Leu Pro Thr 
450 455 460 



Ser lie Tyr Ala Val Pro Ser Leu Glu Pro His Phe Ala Gin Val Pro 
465 470 475 480 



Gin Pro Ser Val Ser Ser Asn Gly Met Leu Tyr Pro Ala Leu Ala Lys 
485 490 495 



Glu Ser Gly Tyr He Ala Pro Gin Gly Ala Cys Asn Lys Met Ala Thr 
500 505 510 



He Asp Glu Asn Gly Asn Gin Asn Gly Ser Gly Arg Pro Gly Phe Ala 
515 520 525 



Phe Cys Gin Pro Leu Glu His Asp Leu Leu Ser Pro Val Glu Lys Lys 
■ 530 535 540 



Pro Glu Ala Thr Ala Lys Tyr Val Pro Ser Lys Val His Phe Cys Ser 
545 550 555 560 



Val Pro Glu Asn Glu Glu Asp Ala Ser Leu Lys- Arg His Leu Thr Pro 
565 570 575 



Pro Gin Gly Asn Ser Pro His Ser Asn Glu Arg Lys Ser Thr His Ser 
580 585 590 



Asn Lys Pro Ser Ser His Pro His Ser Leu Lys Cys Pro Gin Ala Gin 
595 600 605 



Ala Trp Gin Ala Gly Glu Asp Lys Arg Ser Ser Arg Leu Ser Glu Pro 
610 615 620 



Trp Glu Gly Asp Phe Gin Glu Asp His Asn Ala Asn Leu Trp Arg Arg 
625 630 635 640 



Leu Glu Arg Glu Gly Leu Gly Gin Ser Leu Ser Gly Asn Phe Gly Lys 
645 650 655 



Thr Lys Ser Ala Phe Ser Ser Leu Gin Asn lie Pro Glu Ser Leu Arg 
660 665 670 
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Arg His Ser Ser Leu Glu Leu Gly Arg Gly Thr Gin Glu Gly Tyr Pro 
675 680 • ** 685 



Gly Gly Arg Pro Thr Cys Ala Val Asn Thr Lys Ala Glu Asp Pro Gly 
690 695 700 



Arg Lys Ala Ala Pro Asp Leu Gly Ser His Leu Asp Arg Gin Val Ser 
705 710 715 720 



Tyr Pro Arg Pro Glu Gly Arg Thr Gly Ala Ser Ala Ser Phe Asn Ser 
725 730 735 



Thr Asp Pro Ser Pro Glu Glu Pro Pro Ala Pro Ser His Pro His Thr 
740 745 ■ 750 



Ser Ser Leu Gly Arg Arg Gly Pro Gly Pro Gly Ser Ala Ser Ala Leu 
755 . 760 765 



Gin Gly Phe Gin Tyr Gly Lys Pro His Cys Ser Val Leu Glu Lys Val 
770 775 780 



Ser Lys Phe Glu Gin Arg Glu Gin Gly Ser Gin Arg Pro Ser Val Gly 
785 790 795 800 



Gly Ser Gly Phe Gly His Asn Tyr Arg Pro His Arg Thr Val Ser Thr 
805 810 815 



Ser Ser Thr Ser Gly Asn Asp Phe Glu Glu Thr Lys Ala His He Arg 
820 825 830 



Phe Ser Glu Ser Ala Glu Pro Leu Gly Asn Gly Glu Gin His Phe Lys 
835 840 845 



Asn Gly Glu Leu Lys Leu Glu Glu Ala Ser Arg Gin Pro Cys Gly Gin 
850 855 860 



Gin Leu Ser Gly Gly Ala Ser Asp Ser Gly Arg Gly Pro Gin Arg Pro 
865 870 875 880 



Asp Ala Arg Leu Leu Arg Ser Gin Ser Thr Phe Gin Leu Ser Ser Glu 
885 890 895 



Pro Glu Arg Glu Pro Glu Trp Arg Asp Arg Pro Gly Ser Pro Glu Ser 
900 905 910 



Pro Leu Leu Asp Ala Pro Phe Ser Arg Ala Tyr Arg Asn Ser He Lys 
915 920 925 



Asp Ala Gin Ser Arg Val Leu Gly Ala Thr Ser Phe Arg Arg Arg Asp 
. 930 935 940 
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Leu Glu Leu Gly Ala Pro Val Ala Ser Arg Ser Trp .Arg Pro Arg Pro 
945 950 ' 955 960 



Ser Ser Ala His Val Gly Leu Arg Ser Pro Glu Ala Ser Ala Ser Ala 
965 970 975 



Ser Pro His Thr Pro Arg Glu Arg His Ser Val Thr Pro Ala Glu Gly 
980 985 990 



Asp Leu Ala Arg Pro Val Pro Pro Ala Ala Arg Arg Gly Ala Arg Arg 
995 1000 1005 



Arg Leu Thr Pro Glu Gin Lys Lys Arg Ser Tyr Ser Glu Pro Glu 
1010 1015 1020 



Lys Met Asn Glu Val Gly He Val Glu Glu Ala Glu Pro Ala Pro 
1025 1030 1-035 



Leu Gly Pro Gin Arg Asn Gly Met Arg .Phe Pro Glu Ser Ser Val 
1040 1045 1050 



Ala Asp Arg Arg Arg Leu Phe Glu Arg Asp Gly Lys Ala Cys Ser 
1055 1060 1065 



Thr Leu Ser Leu Ser Gly Pro. Glu Leu Lys Gin Phe Gin Gin Ser 
1070 1075 1080 



Ala Leu Ala Asp Tyr He Gin Arg Lys Thr Gly Lys Arg Pro Thr 
1085 1090 1095 



Ser Ala Ala Gly Cys Ser Leu Gin Glu Pro Gly Pro Leu Arg Glu 
1100 1105 1110 



Arg Ala Gin Ser Ala Tyr Leu Gin Pro Gly Pro Ala Ala Leu Glu 
1115 1120 1125 



Gly Ser Gly Leu Ala Ser Ala Ser Ser Leu Ser Ser Leu Arg Glu 
1130 1135 1140 



Pro Ser Leu Gin Pro Arg Arg Glu Ala Thr Leu Leu Pro Ala Thr 
1145 1150 1155 



Val Ala Glu Thr Gin Gin Ala Pro Arg Asp Arg Ser Ser Ser Phe 
1160 1165 1170 



Ala Gly Gly Arg Arg Leu Gly Glu Arg Arg Arg Gly Asp Leu Leu 
1175 1180 1185 



Ser Gly Ala Asn Gly Gly Thr Arg Gly Thr Gin Arg Gly Asp Glu 
1190 1195 1200 
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Thr Pro Arg Glu Pro Ser Ser Trp Gly Ala Arg Ala Gly Lys Ser 
. 1205 1210' 1215 



Met Ser Ala Glu Asp Leu Leu Glu Arg Ser Asp Val Leu Ala Gly 
1220 1225 * 1230 



Pro Val His Val Arg Ser Arg Ser Ser Pro Ala Thr Ala Asp Lys 
1235 1240 1245 



Arg Gin Asp Val Leu Leu Gly Gin Asp Ser Gly Phe Gly Leu Val 
1250 1255 1260 



Lys Asp Pro Cys Tyr Leu Ala Gly Pro Gly Ser Arg Ser Leu Ser 
1265 1270 1275 



Cys Ser Glu Arg Gly Gin Glu Glu Met Leu Leu Leu Phe His His 
. 1280 1285 i290 



Leu Thr Pro Arg Trp Gly Gly Ser Gly Cys Lys Ala lie Gly Asp 
1295 1300 . 1305 



Ser Ser Val Pro Ser Glu Cys Pro Gly Thr Leu Asp His Gin Arg 
1310 1315 1320 



Gin Ala Ser Arg Thr Pro Cys Pro Arg Pro Pro Leu Ala Gly Thr 
1325 1330 1335 



Gin Gly Leu Val Thr Asp Thr Arg Ala Ala Pro Leu Thr Pro He 
1340 1345 1350 



Gly Thr Pro Leu Pro Ser Ala He Pro Ser Gly Tyr Cys Ser Gin 
1355 1360 1365 



Asp Gly Gin Thr Gly Arg Gin Pro Leu Pro Pro Tyr Thr Pro Ala 
1370 1375 1380 



Met Met His Arg Ser Asn Gly His Thr Leu Thr Gin Pro Pro Gly 
1385 1390 1395 



Pro Arg Gly Cys Glu Gly Asp Gly Pro Glu His Gly Val Glu Glu 
1400 1405 1410 



Gly Thr Arg Lys Arg Val Ser Leu Pro Gin Trp Pro Pro Pro Ser 
1415 1420 1425 



Arg Ala Lys Trp Ala His Ala Ala Arg Glu Asp Ser Leu Pro Glu 
1430 1435 1440 



Glu Ser Ser Ala Pro Asp Phe Ala Asn Leu" Lys His Tyr Gin Lys 
1445 1450 1455 



o 



WO 2004/022778 PCT/AU2003/001166 

107 



Gin Gin Ser Leu Pro Ser Leu Cys Ser Thr Ser Asp Pro Asp Thr 
1460 1465 1470 



Pro Leu Gly Ala Pro Ser Thr Pro Gly Arg lie Ser Leu Arg He 
1475 1480 1485 



Ser Glu Ser Val Leu Arg Asp Ser Pro Pro Pro His Glu Asp Tyr 
1490 1495 1500 * 



Glu Asp Glu Val Phe Val Arg Asp Pro His Pro Lys Ala Thr Ser 
1505 1510 1515 



Ser Pro Thr Phe Glu Pro Leu Pro Pro Pro Pro Pro Pro Pro Pro 
1520 1525 1530 



Ser Gin Glu Thr Pro Val Tyr Ser Met Asp Asp Phe Pro Pro Pro 
1535 1540 • 1545 



Pro Pro His Thr Val Cys Glu Ala Gin Leu Asp Ser Glu Asp Pro 
1550 1555 1560 



Glu Gly Pro Arg Pro Ser Phe Asn Lys Leu Ser Lys Val Thr He 
1565 1570 1575 



Ala Arg Glu Arg' His Met Pro Gly Ala Ala His Val Val Gly Ser 
1580 1585 1590 



Gin Thr Leu Ala Ser Arg Leu Gin Thr Ser He Lys Gly Ser Glu 
1595 1600 1605 



Ala Glu Ser Thr Pro Pro Ser Phe Met Ser Val His Ala Gin Leu 
1610 1615 1620 



Ala Gly Ser Leu Gly Gly Gin Pro Ala Pro He Gin Thr Gin Ser 
1625 1630 1635 



Leu Ser His Asp Pro Val Ser Gly Thr Gin Gly Leu Glu Lys Lys 
1640 1645 1650 



Val Ser Pro Asp Pro Gin Lys Ser Ser Glu Asp He Arg Thr Glu 
1655 1660 1665 



Ala Leu Ala Lys Glu He Val His Gin Asp Lys Ser Leu Ala Asp 
1670 1675 1680 



He Leu Asp Pro Asp Ser Arg Leu Lys Thr Thr Met Asp Leu Met 
1685 1690 1695 



Glu Gly Leu Phe Pro Arg Asp Val Asn Leu Leu Lys Glu Asn Ser 
1700 1705 1710 
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Val Lys Arg Lys Ala He Gin Arg Thr Val Ser Ser Ser Gly Cys 
1715 1720 1725 

Glu Gly Lys Arg Asn Glu Asp Lys Glu Ala Val Ser Met Leu Val 
1730 1735 1740 

Asn cys Pro Ala Tyr Tyr Ser Val Ser Ala Pro Lys Ala Glu Leu 
17 *5 1750 1755 

Leu Asn Lys He Lys Glu Met Pro Ala Glu Val Asn Glu Glu Glu 
!760 1765 1770 

Glu Gin Ala Asp Val Asn Glu Lys Lys Ala Glu Leu He Gly Ser 
1775 1 7 80 1785 

Leu Thr His Lys Leu Glu Thr Leu Gin Glu Ala Lys Gly Ser Leu 
1790 1795 1800 

Leu Thr Asp He Lys Leu Asn Asn Ala Leu Gly Glu Glu Val Glu 
1805 isio i 8 i5 

Ala Leu He Ser Glu Leu Cys Lys Pro Asn Glu Phe Asp Lys Tyr 
1820 i 8 25 1830 

Arg Met Phe He Gly Asp Leu Asp Lys Val Val Asn Leu Leu Leu 
1835 1840 1845 

Ser Leu Ser Gly Arg Leu Ala Arg Val Glu Asn Val Leu Ser Gly 
1850 1855 i860 

Leu Gly Glu Asp Ala Ser Asn Glu Glu Arg Ser Ser Leu* Tyr Glu 
1865 1870 1875 

Lys Arg Lys He Leu Ala Gly Gin His Glu Asp Ala Arg Glu Leu 
1*80 1885 1890 



Lys Glu Asn Leu Asp Arg Arg Glu Arg Val Val Leu Gly He Leu 
1895 1900 1905 

Ala Asn Tyr Leu Ser Glu Glu Gin Leu Gin Asp Tyr Gin His Phe 
1910 1915 1920 

Val Lys Met Lys Ser Thr Leu Leu He Glu Gin Arg Lys Leu Asp 
1925 1930 1935 



Asp Lys He Lys Leu Gly Gin Glu Glh Val Lys Cys Leu Leu Glu 
1940 1945 1950 



Ser Leu Pro Ser Asp Phe He Pro Lys Ala Gly Ala Leu Ala Leu 
1955 i960 1965 
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Pro Pro Asn Leu Thr Ser Glu Pro lie Pro Ala Gly Gly Cys Thr 
1970 1975 1980 



Phe Ser Gly lie Phe Pro Thr Leu Thr Ser Pro Leu 
1985 1990 1995 



<210> 39 

<211> 2529 

<212> DNA . . 

<213> NM_004433 ELF3, E74~like factor 3 

<220> 

<221> CDS 

<222> (203) (1315) 

<223> 



<400> 39 

gagagagaga gagagagaga actagtctcg aggtcgacgc ggccgcgaat tcgcggccgc 60 

gtcgaccgaa gacgcaggct ctatttagag ccgggtaggg gagcgcacgg ccagatacct 120 

cagcgctacc tggcggaact ggatttctct cccgcctgcc ggcctgcctg ccacagccgg 180 

actccgccac tccggtagcc tc atg get gca acc tgt gag att age aac att 232 

Met Ala Ala Thr Cys Glu He Ser Asn He 
15 10 

ttt age aac tac ttc agt gcg atg tac age teg gag gac tec acc ctg 280 
Phe Ser Asn Tyr Phe Ser Ala Met Tyr Ser Ser Glu Asp Ser Thr Leu 
15 20 25 

gec tct gtt ccc cct get gee acq ttt ggg gee gat gac ttg gta ctg 328 
Ala Ser Val Pro Pro Ala Ala Thr Phe Gly Ala Asp Asp Leu Val Leu 
30 35 40 

acc ctg age aac ccc cag atg tea ttg gag ggt aca gag aag gec age 376 
Thr Leu Ser Asn Pro Gin Met Ser Leu Glu Gly Thr Glu Lys Ala Ser 
45 50 55 

tgg ttg ggg gaa cag ccc cag ttc tgg teg aag acg cag gtt ctg gac 424 
Trp Leu Gly Glu Gin Pro Gin Phe Trp Ser Lys Thr Gin Val Leu Asp 
60 65 70 

tgg ate age tac caa gtg gag aag aac aag tac gac gca age gee att 472 
Trp He Ser Tyr Gin Val Glu Lys Asn Lys Tyr Asp Ala Ser Ala He 
75 80 85 90 

gac ttc tea cga tgt gac atg gat ggc gee acc etc tgc aat tgt gec 520 
Asp Phe Ser Arg Cys Asp Met Asp Gly Ala Thr Leu Cys Asn Cys Ala 
95 . 100 105 

ctt gag gag ctg cgt ctg gtc ttt ggg cct ctg. ggg gac caa etc cat 568 
Leu Glu Glu Leu Arg Leu Val Phe Gly Pro Leu Gly Asp Gin Leu His 
110 115 120 
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gcc cag ctg cga gac etc act tec age tct tct gat gag etc agt tgg 616 
Ala Gin Leu Arg Asp Leu Thr Ser Ser Ser Ser Asp Glu Leu Ser Trp 
125 130 135 

ate att gag ctg ctg gag aag gat ggc atg gcc ttc cag gag gcc eta 664 
He He Glu Leu Leu Glu Lys Asp Gly Met Ala Phe Gin Glu Ala Leu 
140 145 150 

gac cca ggg ccc ttt gac cag ggc age ccc ttt gcc cag gag ctg ctg 712 
Asp Pro Gly Pro Phe Asp Gin Gly Ser Pro Phe Ala Gin Glu Leu Leu 
155 160 165 170 

gac gac ggt cag caa gcc age ccc tac cac ccc ggc age tgt ggc gca 760 
Asp Asp Gly Gin Gin Ala Ser Pro Tyr His Pro Gly Ser Cys Gly Ala 
175 180 185 

gga gcc ccc tec ccc ggc age tct gac gtc tec" ace gca ggg act ggt 808 
Gly Ala Pro Ser Pro Gly Ser Ser Asp Val Ser Thr Ala Gly Thr Gly 
190 195.' 200. 

get tct egg age tec cac tec tea gac tec ggt gga agt gac gtg gac 856 
Ala Ser Arg Ser Ser His Ser Ser Asp Ser Gly Gly Ser Asp Val Asp 
205 210 215 

ctg gat ccc act gat ggc aag etc ttc ccc age gat ggt ttt cgt gac 904 
Leu Asp Pro Thr Asp Gly Lys Leu Phe Pro Ser Asp Gly Phe Arg Asp 
220 225 230 

tgc aag aag ggg gat ccc aag cac ggg aag egg aaa cga ggc egg ccc 952 
Cys Lys Lys Gly Asp Pro Lys His Gly Lys Arg Lys Arg Gly Arg Pro 
235 240 245 250 

cga aag ctg age aaa gag tac tgg gac tgt etc gag ggc aag aag age 1000 
Arg Lys Leu Ser Lys Glu Tyr Trp Asp Cys Leu Glu Gly Lys Lys Ser 
255 260 265 

aag cac gcg ccc aga ggc ace cac ctg tgg gag ttc ate egg gac ate 1048 
Lys His Ala Pro Arg Gly Thr His Leu Trp Glu Phe He Arg Asp He 
270 275 280 

etc ate cac ccg gag etc aac gag ggc etc atg aag tgg gag aat egg 1096 
Leu He His Pro Glu Leu Asn Glu Gly Leu Met Lys Trp Glu Asn Arg 
285 290 295 

cat gaa ggc gtc ttc aag ttc ctg cgc tec gag get gtg gcc caa eta 1144 
His Glu Gly Val Phe Lys Phe Leu Arg -Ser .Glu Ala Val Ala Gin Leu 
300 305 310 

tgg ggc caa aag aaa aag aac age aac atg ace tac gag aag ctg age 1192 
Trp Gly Gin Lys Lys Lys Asn Ser Asn Met Thr Tyr Glu Lys Leu Ser 
315 320 325 330 

egg gcc atg agg tac tac tac aaa egg gag ate ctg gaa egg gtg gat 1240 
Arg Ala Met Arg Tyr Tyr Tyr Lys Arg Glu He Leu Glu Arg Val Asp 
335 340 345 



ggc egg cga etc gtc tac aag ttt ggc aaa aac tea age ggc tgg aag 
Gly Arg Arg Leu Val Tyr Lys Phe Gly Lys Asn Ser Ser Gly Trp Lys 
350 355 360 



1288 



gag gaa gag gtt etc cag agt egg aac tgagggttgg aactataccc 1335 
Glu Glu Glu Val Leu Gin Ser Arg Asn 
365 370. 

gggaccaaac tcacggacca ctcgaggcct gcaaaccttc ctgggaggac aggcaggeca 1395 

gatggcccct ccactgggga atgctcccag ctgtgctgtg gagagaagct gatgttttgg 1455 
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tgtattgtca gccatcgtcc ttggactcgg agactatggc ctcgcctccc caccctcctc 1515 

ttggaattac aagccctggg gtttgaagct gactttatag ctgcaagtgt atctcctttt 1575 

atctggtgcc tcctcaaacc cagtctcaga cactaaatgc agacaacacc ttcctcctgc 1635 

agacacttgg actgagccaa ggaggcttgg gaggccctag ggagcaccgt gatggagagg 1695 

acagagcagg ggctccagca cttctttctg gactggcgtt cacctccctg ctcagtgctt 1755 

gggctccacg ggcaggggtc agagcactcc ctaatttatg tgctatataa atatgtcaga 1815 

tgtacataga gatctatttt ttctaaaaca ttcccctccc cactcctctc ccacagagtg* 1875 

ctggactgtt ccaggccctc cagtgggctg atgctgggac ccttaggatg gggctcccag 1935 

♦ctcctttctc ctgtgaatgg aggcagagac ctccaataaa gtgccttctg ggctttttct 1995 

aacctttgtc ttagctacct gtgtactgaa atttgggcct ttggatcgaa tatggtcaag 2055 

aggttggagg ggaggaaaat gaaggtctac caggatgagg gtgagggcaa aggctgacga 2115 

agaggggagt tacagatttc ctgtagcagg tgtgggctta cagacacatg gactgggctg 2175 

ggaggcgagc aaaggaagca gctgagactg ttggagaacg ttacaagact tcatgcaaga 2235 

aggacatgaa ctcagaacac tgaggtcaga agcatctgct gtcatgacac cgctcgagtg 2295 

accttgacct tgaccaagtc ttgtccttgt ttaggactga tttttcctat taggctaggg 2355 

tttggacctg atgttctcaa gatgtctaga attgcatggc tggccttgtg gaatagatgg 2415 

ttttgcattc cagccaagtg tgctgtaaac tgtatatctg taatatgaat cccagctttt 2475 

gagtctgaca aaatcagagt taggatcttg taaaggaaaa aaaaaaaaaa aaaa 2529 

<210> 40 
<211> 371 
<212> PRT 

<213> NM 004433 ELF3, E74-like factor 3 



<400> 40 

Met Ala Ala Thr Cys Glu He Ser Asn He Phe Ser Asn Tyr Phe Ser 
1 5 10 15 



Ala Met Tyr Ser Ser Glu Asp Ser Thr Leu Ala Ser Val Pro Pro Ala 
20 25 30 

Ala Thr Phe Gly Ala Asp Asp Leu Val Leu Thr Leu Ser Asn Pro Gin 
35 * 40 45 

Met Ser Leu Glu Gly Thr Glu Lys Ala Ser Trp Leu Gly Glu Gin Pro 
50 55 60 



Gin Phe Trp Ser Lys Thr Gin Val Leu Asp Trp He Ser Tyr Gin Val 
65 " 70 75 80 
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Glu Lys Asn Lys Tyr Asp Ala Ser Ala lie Asp Phe Ser Arg Cys Asp 
85 90 95 



Met Asp Gly Ala Thr Leu Cys Asn Cys Ala Leu Glu Glu Leu Arg Leu 
100 105 HO 



Val Phe Gly Pro Leu Gly Asp Gin Leu His Ala Gin Leu Arg Asp Leu 
115 120 125 



Thr Ser Ser Ser Ser Asp Glu Leu Ser Trp lie lie Glu Leu Leu Glu 
130 135 140 



Lys Asp Gly Met Ala Phe Gin Glu Ala Leu Asp Pro Gly Pro Phe Asp 
145 150 155 160 



Gin Gly Ser Pro Phe Ala Gin Glu Leu Leu Asp Asp Gly Gin Gin Ala 
165 170 175 



Ser Pro Tyr His Pro Gly Ser Cys Gly Ala Gly Ala Pro Ser Pro Gly 
180 " 185 190 



Ser Ser Asp Val Ser Thr Ala Gly Thr Gly Ala Ser Arg Ser Ser His 
195 200 205 



Ser Ser Asp Ser Gly Gly Ser Asp Val Asp Leu Asp Pro. Thr Asp Gly 
210 215 220 



Lys Leu Phe Pro Ser Asp Gly Phe Arg Asp Cys Lys Lys Gly Asp Pro 
225 230 235 240 



Lys His Gly Lys Arg Lys Arg Gly Arg Pro Arg Lys Leu Ser Lys Glu 
245 250 255 



Tyr Trp Asp Cys Leu Glu Gly Lys Lys Ser Lys His Ala Pro Arg Gly 
260 265 270 



Thr His Leu Trp Glu Phe lie Arg Asp .He Leu He His Pro Glu Leu 
•275 280 285 



Asn Glu Gly Leu Met Lys Trp Glu Asn Arg His Glu Gly Val Phe Lys 
290 295 300 



Phe Leu Arg Ser Glu Ala Val Ala Gin Leu Trp Gly Gin Lys Lys Lys 
305 310 315 320 



Asn Ser Asn Met Thr- Tyr Glu Lys Leu Ser Arg Ala Met Arg Tyr Tyr 
325 330 335 



Tyr Lys Arg Glu .He Leu Glu Arg Val Asp Gly Arg Arg Leu Val Tyr 
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340 " 345 350 



Lys Phe Gly Lys Asn Ser Ser ' Gly Trp Lys Glu Glu Glu Val Leu Gin 

355 .360 365 



Ser Arg Asn 
370 



<210> 41 

<211> 4020 

<212> DNA 

<213> NM_019027 FLJ20273 
<220> 

<221> CDS 

<222> (240) (1811) 

<223> 



<400> 41 

gtcaattgcc ctttagtccc aggactaacc ggaagcttct gcaacaggag gacattgaaa 60 

ataagatgga acccatccac ataaggattt gcctcaaagg gcactgcaaa aattgaacag 120 

aggaatccca aggaagctgc ctgaatttgc ctgtatactc tcgttctgcg acttataaag 180 

gaccagacaa atcaaattag tggttttggt ttccgccagc tgtggatgcc tttgacatt 239 

atg acc gca gag gat tec acc gca gec atg age agt gac teg gee gee 287 
Met Thr Ala Glu Asp Ser Thr Ala Ala Met Ser Ser Asp Ser Ala Ala 
15 10 15 

ggg tec teg gec aag gtg ccc gag ggc gtg gcg ggc gcg ccc aac gag 335 
Gly Ser Ser Ala Lys Val Pro Glu Gly Val Ala Gly Ala Pro Asn Glu 
20 25 30 

gca gca ctg ctg gcg ctg atg gag cgc acg ggc tac age atg gtg caa 383 
Ala Ala Leu Leu Ala Leu Met Glu Arg Thr Gly Tyr Ser Met Val Gin 
35 40 45 

gag aac ggg cag cgc aag tac ggc ggc cca ccg ccc ggc tgg gag ggc 431 
Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Glu Gly 
50 55 60 

ccg cac ccg cag cgt ggc tgc gag gtc ttc gtg ggc aag ate ccg cgc 479 
Pro His Pro Gin Arg Gly Cys Glu Val Phe Val Gly Lys lie Pro Arg 
65 70 75 80 

gac gtg tac gag gac gag ctg gtg ccc gtg ttc gag gec gtg ggc cgc 527 
Asp Val Tyr Glu Asp Glu Leu Val Pro Val Phe Glu Ala Val Gly Arg 
85 90 95 

acc tac gag ctg cgc etc atg atg gac ttt gac ggc aag aac cgc ggc 575 
Thr Tyr Glu Leu Arg Leu Met Met Asp Phe Asp Gly Lys Asn Arg Gly 
100 105 HO 
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tac gcc ttc gtc atg tac tgc cac aag cac gag gcc aag cgc gca gtg 623 
Tyr Ala Phe Val Met Tyr Cys His Lys His" Glu Ala Lys Arg Ala Val 
115 120 125 

cgt gag etc aac aac tac gag ate cgc ccg ggc cgc ctg etc ggc gtg 671 
Arg Glu Leu Asn Asn Tyr Glu He Arg Pro Gly Arg Leu Leu Gly Val 
130 135 140 

tgc tgc age gtg gac aac tgc cgc etc ttc ate ggc .ggg ate ccc aag 719 
Cys Cys Ser Val Asp Asn Cys Arg Leu Phe He Gly Gly He Pro Lys 
145 150 155 160 

atg aag aag cgc gag gaa ate ctg gag gag att gcc aag gtc acc gag 767 
Met Lys Lys Arg Glu Glu He Leu Glu Glu lie Ala Lys Val Thr Glu 
165 170 175 

ggc gtg ctg gac gtg ate gtc tac gcc age gcg gcc gac aag atg aag 815 
Gly Val Leu Asp Val lie Val Tyr Ala Ser Ala Ala Asp Lys Met Lys 
180 185 190 

aac cgc ggc ttc gcc ttc gtg gag tac gag age cac cgc gcg get gcc 863 
Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala Ala 
195 200 205 

atg get cgc cgc aag etc atg cct ggc cgc ate cag ctg tgg ggc cac 911 
Met Ala Arg Arg Lys Leu Met Pro Gly * Arg He Glh Leu Trp Gly His 
210 215 220 

cag ate gcc gtg gac tgg gcc gag cct gag ate gac gtg gac gag gac 959 
Gin lie Ala Val Asp Trp Ala Glu Pro Glu He Asp Val Asp Glu Asp - 
225 230 235 240 

gtg atg gag acc gtg aag ate etc tac gtg cgc aac etc atg ate gag 1007 
Val Met Glu Thr Val Lys He Leu Tyr Val Arg Asn Leu Met He Glu 
245 250 255 

acc acc gag gac acc ate aag aag age ttc ggc cag ttc aac ccc ggc 1055 
Thr Thr Glu Asp Thr He Lys Lys Ser Phe Gly Gin Phe Asn Pro Gly 
260 265 270 

tgc gtg gag cgc gtc aag aag ate cgc gac tac gcc ttc gtg cac ttc 1103 
Cys Val Glu Arg Val Lys Lys He Arg Asp Tyr Ala Phe Val His Phe 
275 280 285 

acc age cgc gag gat gcc gtg cat gcc atg aac aac etc aac ggc act 1151 
Thr Ser Arg Glu Asp Ala Val His Ala Met Asn Asn Leu Asn Gly Thr 
• 290 295 300 

gag ctg gag ggc teg tgc ctg gag gtc acg ctg gcc aag ccc gtg gac 1199 
Glu Leu Glu Gly Ser Cys Leu Glu Val Thr Leu Ala Lys Pro Val Asp 
305 310 315 320 

aag gag cag tac teg cgc tac cag aag gca gcc agg ggc ggc ggc gcg 1247 
Lys Glu Gin Tyr Ser Arg Tyr Gin Lys Ala Ala Arg Gly Gly Gly Ala 
325 330 335 

get gag gca gcg cag cag ccc age tac gtg tac tec tgc gac ccc tac 1295 
Ala Glu Ala Ala Gin Gin Pro Ser Tyr Val Tyr Ser Cys Asp Pro Tyr 
340 345 350 

aca ctg gcc tac tac ggc tac ccc tac aac gcg etc att ggg ccc aac 1343 
Thr Leu Ala Tyr Tyr Gly Tyr Pro Tyr Asn Ala Leu He Gly Pro Asn 
355 360 365 

agg gac tac ttt gtg aaa gta gcc ate cct gcc att ggg get cag tat 1391 
Arg Asp Tyr Phe Val Lys Val Ala He Pro Ala He Gly Ala Gin Tyr 
370 375 380 
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tec atg ttt cca gca get cca gec cct aaa atg att gaa.gat ggc aaa 1439 

Ser Met Phe Pro Ala Ala Pro Ala Pro Lys Met He Glu Asp Gly Lys' 
385 390 395 400 

ate cac aca gtg gag cac atg ate age ccc att get gtg cag cca gac 1487 

lie His Thr Val Glu His Met He Ser Pro He Ala Val Gin Pro Asp 
405 410 415 

cca gee agt get get gee gee gca gee gcg gec gca gec gec gca gec 1535 

Pro Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 

420 425 430 

get gtc' att ccc act gtg teg acg cca cca cct ttc ,cag ggc cgc cca 1583 

Ala Val lie Pro Thr Val Ser Thr Pro Pro Pro Phe Gin Gly Arg Pro 
435 440 445 

ata act cca gta tac acg gtg get cca aac gtt cag aga att cct act 1631 

He Thr Pro Val Tyr Thr Val Ala Pro Asn Val Gin Arg He Pro Thr 
450 455 460 

gee ggg ate tac ggg gee agt tac gtg cca ttt get get cca get aca 1679 

Ala Gly He Tyr Gly Ala Ser Tyr Val Pro Phe Ala Ala Pro Ala Thr 
465 470 475 480 

gee acg ate gee aca eta cag aag aac gcg gca gee gcg gee gee gtg 1727 

Ala Thr He Ala Thr Leu Gin Lys Asn Ala Ala Ala Ala Ala Ala Val 
485 490 495 

tat gga gga tac gca ggc tac ata cct cag gee ttc cct get get gee 1775 

Tyr Gly Gly Tyr Ala Gly Tyr He Pro .Gin Ala Phe Pro Ala Ala Ala 

500 505 - 510 

att cag gtc ccc ate ccc gac gtc tac cag aca tac tgaggctggt 1821 

He Gin Val Pro He Pro Asp Val Tyr Gin Thr Tyr 
515 520 



gaccagcacg 


aagacagacc 


acacaaacac 


cactgaagga 


aegcttgact 


atttatgaag 


1881 


aaggaacatg 


ttggattcac 


acatgcaacc 


tgaaagtgaa 


gaatgttagc 


agatttattt 


1941 


ctgaattatt 


ttatatacat 


gaagttttca 


ctagtttttt 


aagactattt 


tcaacttagc 


2001 


atgcctacgt 


tcatacattt 


ccaaaagact 


tgcaatggtt 


cgtgccttca 


ttccatcttt 


2061 


taaaaatttg 


tatgetgtae 


tacatttgta 


tagaggtttt 


tgttgttgtt 


tttttaagga 


2121 


tatattttca 


gtatgaaggt 


tattttctta 


acttctgcac 


tccagagatt 


tctattttgt 


2181 


agtaccttca 


ataatatatc 


aactatatat 


taaaaaagca 


cacttgagga 


gctagggaac 


2241 


tattttgaaa 


aatatataca 


atatttaaag 


atacaaacag 


tagtgcttaa 


aaatactaca 


2301 


taaagcatta 


ttttaaaggt 


tatactggaa 


agtgcaattt 


taaaatgagt 


aaaacctctg 


2361 


tatttctget 


ggcattaagg 


gttgatggtg 


ttaccatgta 


tcatcatggc 


ggtactattt 


2421 


tttaaaagaa 


attaaacact 


ggatctctcc 


ttaagecaac 


attgaaaaga 


cttgccgcac 


2481 


ttctgagtcc 


aaacactgga 


aagctctcct 


tgccaccgtt 


ageegggget 


cattctccat 


2541 


gtgecttage 


cttaaacatg 


cccccactcc 


cacatctctc 


accctgtccc 


ctcctcccca 


2601 


gattcccaat 


cccaccgcaa 


tgtttggcaa 


gectaggact 


gataagtagc 


tctgatagag 


2661 


gagctggtgg 


cttttatact 


tcttcctggg 


tttttgttgg 


ggtttgttgt 


ttcgttgttt 


2721 


tttgtttttt 


ttttgtttgg 


ttggggaagt 


attgtcttct 


acgtgtgcta 


ttttcagtag 


2781 
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cagagtaagc 


acaaggtttt 


aatcgagttg cataagacac ctttgcatag ctatttaatt 


2841 


gcccaatgta 


aaactttaat 


gccatttcta atgcttttat tcatttttga agtatgagtt 


2901 


tgtagggaca 


aagaatgtat 


gttatcgtag acaagacccc cagagactct tttcagcaga 


2961 


aagttatgct 


tctagttgcc 


ttaccatgtt tcttgcaaaa ctgtccatgg tcctcaaggg 


3021 


tgttggaaac 


attatgttta 


ttaaatgggc ctctcttcct ttgctgtgca cttgatgggt 


3081 


gaactggatt 


ggggtgtgca 


catccaggag gaggaggaga gacctgtaga agtttaaaga 


3141 


tagtttgtaa 


atatcttcta 


atgcttgttt ttagtccttt tatgttggag aagttcatgg 


3201 


tatgtagttt 


aatgcaaaat 


gaaaccattt tatttcaatg ttattaaaaa ggtttgtttt 


3261 


attaggaagt 


taatgtattg 


ttgcagtgtt ttgtgcctgt ttaaaggctt ttgtttagca 


3321 


gagtgaatgt 


aaaatacagt 


aaaatgttaa gattgtcatc tactttttaa aaaaaaatat 


3381 


caacttggaa 


ttgtttttta 


aaggctcaat caaggaagtg aggtgtgcaa taaggtagca 


3441 


agtaaaacgc 


agttgcgttt 


ttatgtcatg ttagagatcc atacaatttt ccactcacgg 


3501 


gatttttgtt 


gatggctgaa 


ttcttgtgga ttcataagag gatcatgccc ttagcaagta 


3561 


cttttgtttt 


gttttaaatt 


aagagattcc caaatgcctt tttccccctc atcttgaaat 


3621 


gagatgagtt 


tttatgtgta 


agcaatattt atttaactat tctataaaat tattgagtgc 


3681 


ctactgaggc 


ctttaagcac 


cgctaacatt cctttccatc attcttttga atgacataaa 


3741 


ataattgtgc 


aatgttcctg 


atgatgtacc ccacagctgc attcaaactc aaatctgtgg 


3801 


gaatgagtga 


ctcgaccaaa 


tgtaattcgg atcagatcct catcccctga ctgtgtgaaa 


3861 


aaagtactct 


ccttctagtg 


aaggattgtc acagagtttc actggatgaa actatgaccc 


3921 


agtattctta 


ctgtatttta 


catgtgcctg taaattattt tgccgaaata agaagaagaa 


3981 


gaggaagaaa 


gaacagtaga 


aaaaaaaaat aaaaaaaaa 


4020 



<210> 42 
<211> 524 
<212> PRT 

<213> NMJ)19027 FLJ20273 
<400> 42 

Met Thr Ala Glu Asp Ser Thr Ala Ala Met Ser Ser Asp Ser Ala Ala 
15 10 15 

Gly Ser Ser Ala Lys Val Pro Glu Gly Val Ala Gly. Ala Pro Asn Glu 
20 25 30 

Ala Ala Leu Leu Ala Leu Met Glu Arg Thr Gly Tyr Ser Met Val Gin 
35 40 45 

Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Glu Gly 
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50 55 60 

Pro His Pro Gin Arg Gly Cys Glu Val Phe Val Gly Lys lie Pro Arg 
65 70 75' 80 

Asp Val Tyr Glu Asp Glu Leu Val Pro Val Phe Glu Ala Val Gly Arg 
85 .90 95 

Thr Tyr Glu Leu Arg Leu Met Met Asp Phe Asp Gly Lys Asn Arg Gly 
100 105 110 

Tyr Ala Phe Val Met Tyr Cys His Lys His Glu Ala Lys Arg Ala Val 
115 120 125 

Arg Glu Leu Asn Asn Tyr Glu lie Arg pro Gly- Arg Leu Leu Gly Val 
130 135 140 

Cys Cys Ser Val Asp Asn Cys Arg Leu Phe lie Gly Gly He Pro Lys 
145 150 155 160 

Met Lys Lys Arg Glu Glu He Leu Glu Glu He Ala Lys Val Thr Glu 
165 • 170 175 

Gly Val Leu Asp Val He Val Tyr Ala Ser Ala Ala Asp Lys Met Lys 
180 185 . 190 

Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala Ala 
195 200 205 

Met Ala Arg Arg Lys Leu Met Pro Gly Arg He Gin Leu Trp Gly His 
210 215 220 

Gin He Ala Val Asp Trp Ala Glu Pro Glu He Asp Val Asp Glu Asp 
225 230 235 240 

Val Met Glu Thr Val Lys He Leu Tyr Val Arg Asn Leu Met He Glu 
245 250 255 

Thr Thr Glu Asp Thr He Lys Lys Ser Phe Gly Gin Phe Asn Pro Gly 
260 265 270 

Cys Val Glu Arg Val Lys Lys lie Arg Asp Tyr Ala Phe Val His Phe 
275 280 285 

Thr Ser Arg Glu Asp Ala Val His Ala Met Asn Asn Leu Asn Gly Thr 
290 295 300 

Glu Leu Glu Gly Ser Cys Leu Glu Val Thr Leu Ala Lys Pro Val Asp 
305 310 315 320 



Lys Glu Gin Tyr Ser Arg Tyr Gin Lys Ala Ala Arg Gly Gly Gly Ala 
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325 330 335 

Ala Glu Ala Ala Gin Gin Pro Ser Tyr Val Tyr Ser Cys Asp Pro Tyr 
340 345 350 

Thr Leu Ala Tyr Tyr Gly Tyr Pro Tyr Asn Ala Leu He Gly Pro Asn 
. 355 360 365 

Arg Asp Tyr Phe Val Lys Val Ala lie Pro Ala He Gly Ala Gin Tyr 
370 375 380 

Ser Met Phe Pro Ala Ala Pro Ala Pro Lys Met lie. Glu Asp Gly Lys 
385 390 '395 400 

He His Thr Val Glu His Met lie Ser Pro He Ala Val Gin Pro Asp 
405 410 415 



Pro Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
420 425 430 



Ala Val He Pro Thr Val Ser Thr Pro Pro Pro Phe Gin. Gly Arg Pro 
435 440 445 



He Thr Pro Val Tyr Thr Val Ala Pro Asn Val Gin Arg He Pro Thr 
450 455 460 



Ala Gly He Tyr Gly Ala Ser Tyr Val Pro Phe Ala Ala Pro Ala Thr 
465 470 475 480 



Ala Thr He Ala Thr Leu Gin Lys Asn Ala Ala Ala Ala Ala Ala Val 
485 * 490 - 495 



Tyr Gly Gly Tyr Ala Gly Tyr He Pro Gin Ala Phe Pro Ala Ala Ala 
500 505 510 



He Gin Val Pro He Pro Asp Val Tyr Gin Thr Tyr 
515 520 

<210> 43 

<211> 1451 

<212> DNA 

<213> NM 013952 PAX 8 



<220> 
<221> CDS 

<222> (161) (1354) 
<223> 
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<400> 43 

ttcagaagga ggagagacac cgggcccagg gcaccctcgc gggcgggcgg acccaagcag 60 

tgagggcctg cagccggccg gccagggcag cggcaggcgc ggcccggacc tacgggagga 120 

agccccgagc cctcggcggg ctgcgagcga ctccccggcg atg cct cac aac tec 175 

Met Pro His Asn Ser 
1 5 

ate aga tct ggc cat gga ggg ctg aac cag ctg gga ggg gee ttt gtg 223 
lie Arg Ser Gly His Gly Gly Leu Asn Gin Leu Gly Gly Ala Phe Val 
10 15 20 

aat ggc aga cct ctg ccg gaa gtg gtc cgc cag cgc ate gta .gac ctg 271 
Asn Gly Arg Pro Leu Pro Glu Val Val Arg Gin Arg He Val Asp Leu 
25 30 35 

gec cac cag ggt gta agg ccc tgc gac ate tct cgc cag etc cgc gtc 319 
Ala His Gin Gly Val Arg Pro Cys Asp He Ser Arg Gin Leu Arg Val 
40 45 50 

age cat ggc tgc gtc age aag ate ctt ggc agg tac tac gag act ggc 367 
Ser His Gly Cys Val Ser Lys He Leu Gly Arg Tyr Tyr Glu Thr Gly 
55 60 65 

age ate egg cct gga gtg ata ggg ggc tec aag ccc aag gtg gee ace 415 
Ser He Arg Pro Gly Val He Gly Gly Ser Lys Pro Lys Val Ala Thr 
70 75 80 85 

ccc aag gtg gtg gag aag att ggg gac tac aaa cgc cag aac cct acc 463 
Pro Lys Val Val Glu Lys He Gly Asp Tyr Lys Arg Gin Asn Pro Thr 
90 95 100 

atg ttt gec tgg gag ate cga gac egg etc ctg get gag ggc gtc tgt 511 
Met Phe Ala Trp Glu He Arg Asp Arg Leu Leu Ala Glu Gly Val Cys 
105 110 115 

gac aat gac act gtg ccc agt gtc age tec att aat aga ate ate egg 559 
Asp Asn Asp Thr Val Pro Ser Val Ser Ser He Asn Arg He He Arg 
120 125 130 

acc aaa gtg cag caa cca ttc aac etc cct atg gac age tgc gtg gee 607 
Thr Lys Val Gin Gin Pro Phe Asn Leu Pro Met Asp Ser Cys Val Ala 
135 140 145 

acc aag tec ctg agt ccc gga cac acg ctg ate ccc age tea get gta 655 
Thr Lys Ser Leu Ser Pro Gly His Thr Leu He Pro Ser Ser Ala Val 
150 155 160 165 

act ccc ccg gag tea ccc cag teg gat tec ctg ggc tec acc tac tec 703 
Thr Pro Pro Glu Ser Pro Gin Ser Asp Ser Leu Gly Ser Thr Tyr Ser 
170 175 180 

ate aat ggg etc ctg ggc ate get cag cct ggc age gac aag agg aaa 751 
He Asn Gly Leu Leu Gly He Ala Gin Pro Gly Ser Asp Lys Arg Lys 
185 190 195 

atg gat gac agt gat cag gat age tgc cga eta age att gac tea cag 799 
Met Asp Asp Ser Asp Gin Asp Ser Cys Arg Leu Ser He Asp Ser Gin 
200 205 210 

age age age age gga ccc cga aag cac ctt cgc acg gat gee ttc age 847 
Ser Ser Ser Ser Gly Pro Arg Lys His Leu Arg Thr Asp Ala Phe Ser 
215 220 225 
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cag cac cac etc gag ccg etc gag tgc cca ttt gag egg cag cac tac 895 
Gin His His Leu Glu Pro Leu Glu Cys Pro Phe Glu Arg Gin His Tyr 
230 235 240 245 

cca gag gec tat gec tec ccc age cac acc aaa ggc gag cag ggc etc 943 
Pro Glu Ala Tyr Ala Ser Pro Ser His Thr Lys Gly Glu Gin Gly Leu 
250 255 260 

tac ccg ctg ccc ttg etc aac age acc ctg gac gac ggg aag gee acc 991 
Tyr Pro Leu Pro Leu Leu Asn Ser Thr Leu Asp Asp Gly Lys Ala Thr 
265 270 275 

ctg acc cct tec aac acg cca ctg ggg cgc aac etc teg act cac cag 1039 
Leu Thr Pro Ser Asn Thr Pro Leu Gly Arg Asn Leu Ser Thr His Gin 
280 285 290 

acc tac ccc gtg gtg gca get ccg ccc ttt tgg ate tgc age aag teg 1087 
Thr Tyr Pro Val Val Ala Ala Pro Pro Phe Trp lie Cys Ser Lys Ser 
295 300 .305 

get ccg ggg tec cgc cct tea atg cct ttc ccc atg ctg cct ccg tgt 1135 
Ala Pro Gly Ser Arg Pro Ser Met Pro Phe Pro Met Leu Pro Pro Cys 
310 315 320 325 

acg ggc agt tea egg gee agg ccc tec. tct cag ggc gag aga tgg tgg 1183 
Thr Gly Ser Ser Arg Ala Arg Pro Ser Ser Gin Gly Glu Arg Trp Trp 
330 335 ' 340 

ggc cca cgc tgc ccg gat acc cac ccc aca tec cca cca gcg gac agg 1231 
Gly Pro Arg Cys Pro Asp Thr His Pro Thr Ser Pro Pro Ala Asp Arg 
345 350 355 

gca get atg cct cct ctg cca teg cag gca tgg tgg cag gaa gtg aat 1279 
Ala Ala Met Pro Pro Leu Pro Ser Gin Ala Trp Trp Gin Glu Val Asn 
360 365 370 

act ctg gca atg cct atg. gec aca ccc cct act cct cct aca gcg agg 1327 
Thr Leu Ala Met Pro Met Ala Thr Pro Pro Thr Pro Pro Thr Ala Arg 
375 380 385 

cct ggg get tec cca act cca get tgc tgagttcccc atattattac 1374 
Pro Gly Ala Ser Pro Thr Pro Ala Cys 
390 395 

agttccacat caaggecgag tgcaccgccc accactgcca eggectttga ccatctgtag 1434 
ttgccatggg gacagtg 1451 

<210> 44 

<211> 398 

<212> PRT 

<213> NM 013952 PAX 8 



<400> 44 

Met Pro His Asn Ser He Arg Ser Gly His Gly Gly Leu Asn Gin Leu 
15 10 15 



Gly Gly Ala Phe Val Asn Gly Arg Pro Leu Pro Glu Val Val Arg Gin 
20 25 30 
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Arg lie Val Asp Leu Ala His Gin Gly Val Arg Pro Cys Asp lie Ser 
35 40 45 

Arg Gin Leu Arg Val Ser His Gly Cys Val Ser Lys lie Leu Gly Arg 
50 55 60 

Tyr Tyr Glu Thr Gly Ser lie Arg Pro Gly Val He Gly Gly Ser Lys 
65 70 .75 80 

Pro Lys Val Ala Thr Pro Lys Val Val Glu Lys. He Gly Asp Tyr Lys 
85 90 95 

Arg Gin Asn Pro Thr Met Phe Ala Trp Glu He Arg Asp Arg Leu Leu 
100 105 110 

Ala Glu Gly Val Cys Asp Asn Asp Thr Val Pro Ser Val Ser Ser He 
115 120 125 

Asn Arg He He Arg Thr Lys Val Gin Gin Pro Phe Asn Leu Pro Met 
130 135 140 

Asp Ser Cys Val Ala Thr Lys Ser Leu Ser Pro Gly His Thr Leu He 
145 150 155 160 

Pro Ser Ser Ala Val Thr Pro Pro Glu Ser Pro Gin Ser Asp Ser Leu 
165 170 175 

Gly Ser Thr Tyr Ser He Asn Gly Leu Leu Gly He Ala Gin Pro Gly 
180 185 190 

Ser Asp Lys Arg Lys Met Asp Asp Ser Asp Gin Asp Ser Cys Arg Leu 
195 200 205 

Ser He Asp Ser Gin Ser Ser Ser Ser Gly Pro Arg Lys His Leu Arg 
' 210 215 220 

Thr Asp Ala Phe Ser Gin His His Leu Glu Pro Leu Glu Cys Pro Phe 
225 230 235 240 

Glu Arg Gin His Tyr Pro Glu Ala Tyr Ala Ser Pro Ser His Thr Lys 
245 250 255 

Gly Glu Gin Gly Leu Tyr Pro Leu Pro Leu Leu Asn Ser Thr Leu Asp 
260 265 270 

Asp Gly Lys Ala Thr Leu Thr Pro Ser Asn Thr Pro Leu Gly Arg Asn 
275 280 285 



Leu Ser Thr His Gin Thr Tyr Pro Val Val Ala Ala Pro Pro Phe Trp 
290 295 300 
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Il'e 
305 



Cys Ser Lys Ser Ala Pro Gly Ser Arg Pro Ser Met Pro Phe Pro 
310 315 320 



Met 



Leu Pro Pro Cys Thr Gly *Ser Ser Arg Ala Arg Pro Ser Ser Gin 
325 ' 330 335 



Gly 



Glu Arg Trp Trp Gly Pro Arg Cys Pro Asp Thr His Pro Thr Ser 
340 345 350 



Pro 



Pro Ala Asp Arg Ala Ala Met Pro Pro Leu Pro Ser Gin Ala Trp 
355 360 365 



Trp 



Gin Glu Val Asn Thr Leu Ala Met Pro Met Ala Thr Pro Pro Thr 
370 375 380 



Pro 
385 



Pro Thr Ala Arg Pro Gly Ala Ser Pro Thr Pro Ala Cys 
390 395 



<210> 45 

<211> 326 

<212> DNA 

<213> AI301558 EST 



<400> 45 



tttctttttt 


gttacattca 


tttgattcag tcccttataa accccacacc tcataaacaa 


60 


gagattagaa 


actaaacaaa 


aaggggggcg gggaaggaaa ttctagagtc gttctggttt 


120 


gcaggtggtt 


gcggtcacaa 


agagaaatca tcaagaatgt tcacttggca tgtgtgaaag 


180 


- attcaggggg 


tctgcagctg 


tttagtgttg atgcagttgg gtcaaaagag tatcatgtta 


240 


gtcttctgtg 


ggttttaggg 


agggattatg gagcctccct cccaccccac tggctttctt 


300 


gtgtcacagc 


ctttatttct 


actccg 


326 



<210> 46 

<2U> 1534 

<212> DNA 

<213> NM_018000 FLJ10116 
<220> 

<221> CDS 

<222> (267) (938) 

<223> 
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<400> 46 

tttgcgctcg gggaattaaa agaggggaaa aaaagcccga agaaaactca cgccccaaac 60 

aaaacgcaag gagaggaggg cgcgcggcct gcagccctcg cccgcgtccc cggccgcggc 120 

gtgatgcgcg cggaccagcc cgcgacgccc gggctgccgc tgtccccgca cctggacgct 180 

ggcgcggtgg ccgcgcccca gcctcgatcg ctcgccgcgg cgactcggcc ccaggcttcc 240 

ggcgccggtg ggggccctcg ctctcc atg ggg ctg agg gac tgg ctg aga acc 293 

Met Gly Leu Arg Asp Trp Leu Arg Thr 
1 5 

gtg tgc tgc tgc tgc egg tgc gag tgc ttg gag gag cgc gec ctg cct 341 
Val Cys Cys Cys Cys Arg Cys Glu Cys Leu Glu Glu Arg Ala Leu Pro 
10 15 20 25 

gag aag gag ccc etc gtc agt gat aac aat cca tat tec tea ttt gga 389 
Glu Lys Glu Pro Leu Val Ser Asp Asn Asn Pro Tyr Ser Ser Phe Gly 
30 35 40 

gca act ctg gtg agg gat gat gag aag aat tta tgg agt atg ccc cat 437 
Ala Thr Leu Val Arg Asp Asp Glu Lys Asn Leu Trp Ser Met Pro His 
45 50 55 

gat gtg tec cac aca gag gca gac gac gac aga acc ctg tac aat ttg 485 
Asp Val Ser His Thr Glu Ala Asp Asp Asp Arg Thr Leu Tyr Asn Leu 
60 65 70 

ata gtc att cgt aat cag cag gee aaa gac tea gag gag tgg cag aag 533 
lie Val lie Arg Asn Gin Gin Ala Lys Asp Ser Glu .Glu Trp Gin Lys 
75 80 85 

etc aac tat gat ate cat acc ctg egg cag gtt cga agg gaa gta aga 581 
Leu Asn Tyr Asp lie His Thr Leu Arg Gin Val Arg Arg Glu Val Arg 
90 95 100 105 

aac aga tgg aag tgc ate tta gaa gat tta ggt ttt caa aag gaa get 629 
Asn Arg Trp Lys Cys lie Leu Giu Asp Leu Gly Phe Gin Lys Glu Ala 
110 115 120 

gac tct ttg ttg tea gtg act aaa etc age acc ate agt gat tct aaa 677 
Asp Ser Leu Leu Ser Val Thr Lys Leu Ser Thr He Ser Asp Ser Lys 
125 130 135 

aac aca agg aaa get cga gag atg ttg tta aaa ctg get gaa gaa acc 725 
Asn Thr Arg Lys Ala Arg Glu Met Leu Leu- Lys Leu Ala Glu Glu Thr 
140 145 150 

agt att ttc cca aca agt tgg gag etc tea gag aga tat etc ttt gtt 773 
Ser He Phe Pro Thr Ser Trp Glu Leu Ser Glu Arg Tyr Leu Phe Val 
155 160 165 

gtg gac cgt etc att gca ctt gat get gca gaa gag ttc ttt aag ctt 821 
Val Asp Arg Leu He Ala lieu Asp Ala Ala Glu Glu Phe Phe Lys Leu 
170 175 180 185 

get cgt cga act tac ccc aag aag cct ggg gtt cca tgc ctg gca gat 869 
Ala Arg Arg Thr Tyr Pro Lys Lys Pro Gly Val Pro Cys Leu Ala Asp 
190 195 200 

ggc cag aaa gaa ctg cac ctg tgg ggg gac etc tea tgc aga ctt gca 917 
Gly Gin Lys Glu Leu His Leu Trp Gly Asp Leu Ser Cys Arg Leu Ala 
205 210 215 

cat atg cag gga gta ttg cac tgaagatctt tgctggacct tcttctcttc 968 
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His Met Gin Gly Val Leu His 
220 

agaagataat tttcaaaagg gagcaatgct gtgaatgcag cttgcttctc tctacagatt 1028 

gagaagtcca gcttcaaaag ttacttgcca cttaagcaag gaacttgtca agagatcatg 1088 

• gttcatgtta ctgaaaagac tttaaggatt tgtaaggtta atccatagat tgctgagaac 1148 

aatggaaata tttttatttt tacagatttt gcacttctga attcaggtta aaaactaact 1208 

trgtatttagt ctgcttagag gactgtgact tgaaaatttt tatataccaa tgagcttttt 1268 

ggtagcgtcc acaatgttta aaatatttca taggcgagat ccgtgttctc catttattaa 1328 

tgcattgtag accaatttaa ctgctgtgtt tcaggaaaat tc'ttcctagt ttaataagca 1388 

agctaaaagt tttatttttt atatttagtg cttaatcttt gcctcatgtt atgtaaaatt 1448 

agcctgcaga tattttctct caattctgta gactctcgca agataaacat tcaaacagtg 1508 

aaacaaacaa taaaataaat aaacct 1534 

<210> 47 

<211> 224 

<212> PRT 

<213> NMJU8000 FLJ10116 

<400> 47 

Met Gly Leu Arg Asp Trp Leu Arg Thr Val Cys Cys Cys Cys Arg Cys 
1 5 10 15 

Glu Cys Leu Glu Glu Arg Ala Leu Pro Glu Lys Glu Pro Leu Val Ser 
20 25 30 

Asp Asn Asn Pro Tyr Ser Ser Phe Gly Ala Thr Leu Val Arg Asp Asp 
35 40 45 

Glu Lys Asn Leu Trp Ser Met Pro His Asp Val Ser His Thr Glu Ala 
50 55 60 

Asp Asp Asp Arg Thr Leu Tyr Asn Leu He Val He Arg Asn Gin Gin 
65 70 75 80 

Ala Lys Asp Ser Glu Glu Trp Gin Lys Leu Asn Tyr Asp He His Thr 
85 90 95 

Leu Arg Gin Val Arg Arg Glu Val Arg Asn Arg Trp Lys Cys He Leu 
100 105 110 



Glu Asp Leu Gly Phe Gin Lys Glu Ala Asp Ser Leu Leu Ser Val Thr 
115 120 125 
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Lys Leu Ser Thr lie Ser Asp Ser Lys Asn Thr Arg Lys Ala Arg Glu 
130 135 140 



Met Leu Leu Lys Leu Ala Glu Glu Thr Ser lie Phe Pro Thr Ser Trp 
145 150 155 i 6 o 



Glu Leu Ser Glu Arg Tyr Leu Phe Val Val Asp Arg Leu He Ala Leu 
165 170 175 



Asp Ala Ala Glu Glu Phe Phe Lys Leu Ala Arg Arg Thr Tyr Pro Lys 
180 185 , 190 



Lys Pro Gly Val Pro Cys Leu Ala Asp Gly Gin Lys Glu Leu His Leu 
195 200 205 



Trp Gly Asp Leu Ser Cys Arg Leu Ala His Met Gin Gly Val Leu His 
210 215 220 



<210> 48 

<211> 2385 

<212> DNA 

<213> NM_144724 FLJ30532 
<220> 

<221> CDS 

<222> (71) ,.(1441) 

<223> • 



<400> 48 

gataccggtt tcagagtcct gggcagcgtg cgcgctcttc ctggcggctg cgcaggtgtg 60 

aaaatcacaa atg tea aat gat gga aga tec agg aat egg gac agg cgc 109 
Met Ser Asn Asp Gly Arg Ser Arg Asn Arg Asp Arg Arg 
1 5 10 

tac gat gag gtc cca age gac ctg ccc tat caa gat acc acc ata aga 157 
Tyr Asp Glu Val Pro Ser Asp Leu Pro Tyr Gin Asp Thr Thr lie Arg 
15 20 25 

acc cac cca att ctt cat gac agt gag egg gca gtg age get gat ccc 205 
Thr His Pro He Leu His Asp Ser Glu Arg Ala Val Ser Ala Asp Pro 
30 35 40 45 

ttg cca cca ccc cct etc cca tta cag cca cca ttc ggc cca gac ttc 253 
Leu Pro Pro Pro Pro Leu Pro Leu Gin Pro Pro Phe Gly Pro Asp Phe 
50 55 60 

tac tea agt gac aca gaa gaa cca get ata gcg cca gat etc aaa cca 301 
Tyr Ser Ser Asp Thr Glu Glu Pro Ala He Ala Pro Asp Leu Lys Pro 
65 70 75 

gta agg cgc ttt gtc cct gac tec tgg aag aac ttt ttc aga ggg aag 349 
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Val Arg Arg Phe Val Pro Asp Ser Trp Lys Asn Phe Phe Arg Gly Lys 
80 85 90 

aaa aag gac ccc gaa tgg gat aag ccg gtg tct gat ate agg tac ate 397 
Lys Lys Asp Pro Glu Trp Asp Lys Pro Val Ser Asp lie Arg Tyr He 
95 100 105 

tec gat gga gtg gag tgt tea cca cca gee tct cca gca aga cca aac 445 
Ser Asp Gly Val Glu Cys Ser Pro Pro Ala Ser Pro Ala Arg Pro Asn 
110 115 120 125 

cac cgt teg ccc etc aac tec tgc aaa gat ccc tac gga ggg tea gaa 493 
His Arg Ser Pro Leu Asn Ser Cys Lys Asp Pro Tyr Gly Gly Ser Glu 
130 135 140 

gga ace ttt agt tec egg aaa gag get gac gca gtg ttt ccc egg gat 541 
Gly Thr Phe Ser Ser Arg Lys Glu Ala Asp Ala Val Phe Pro Arg Asp 
145 150 155 

ccc tat gga tct eta gac cga cac aca caa aca gtt cga aca tac agt 589 
Pro Tyr Gly Ser Leu Asp Arg His Thr Gin Thr Val Arg Thr Tyr Ser 
160 165 170 

gag aag gtg gag gag tat aac ctg aga tac tec tac atg aag teg tgg 637 
Glu Lys Val Glu Glu Tyr Asn Leu Arg Tyr Ser Tyr Met Lys Ser Trp 
175 180 185 

gca ggc ctg ctg aga ata ctg ggt gtg gtg gag ctg ctt ttg ggg gee 685 
Ala Gly Leu Leu Arg He Leu Gly Val Val Glu Leu Leu Leu Gly Ala 
190 195 200 205 

ggt gtc ttt get tgt gtc aca get tac att cac aag gac agt gag tgg 733 
Gly Val Phe Ala Cys Val Thr Ala Tyr He His Lys Asp Ser Glu Trp 
210 215 220 

tac aac ttg ttt gga tat tea caa ccg tat ggc atg gga ggc gtt ggt 781 
Tyr Asn Leu Phe Gly Tyr Ser Gin Pro Tyr Gly Met Gly Gly Val Gly 
225 230 ' 235 

gga ttg ggc agt atg tat ggg ggc tat tac tac act ggc cct aag acc 829 
Gly Leu Gly Ser Met Tyr Gly Gly Tyr Tyr Tyr Thr Gly Pro Lys Thr 
240 . 245 " 250 

cct ttt gta etc gtg gtt get gga tta get tgg ate acc acc att att 877 
Pro Phe Val Leu Val Val Ala Gly Leu Ala Trp He Thr Thr He He 
255 r 260 265 

att ctg gtt ctt ggc atg tec atg tat tac egg acc att ctt ctg gac 925 
He Leu Val Leu Gly Met Ser Met Tyr Tyr Arg Thr He Leu Leu Asp 
270 275 280 . 285 

tct aat tgg tgg ccc eta act gaa ttt gga att aac gtt gee ttg ttt 973 
Ser Asn Trp Trp Pro Leu Thr Glu Phe Gly He Asn Val Ala Leu Phe 
290 295 300 

att ttg tat atg gee gca gee ata gtc tat gtg aat gat acc aac cga 1021 
He Leu Tyr Met Ala Ala Ala He Val Tyr Val Asn Asp Thr Asn Arg 
305 310 315 

ggt ggc etc tgc tac tat ccg tta ttt aat aca cca gtg aat gca gtg 1069 
Gly. Gly Leu Cys Tyr Tyr Pro Leu Phe Asn Thr Pro Val Asn Ala Val 
320 325 330 

ttc tgc egg gta gaa gga gga cag ata get gca atg ate ttc ctg ttt 1117 
Phe Cys Arg Val Glu Gly Gly Gin He Ala Ala Met He Phe Leu Phe 
335 340 345 

gtc acc atg ata gtt tat etc att agt get ttg gtt tgc eta aag tta 1165 
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Val Thr Met lie Val Tyr Leu lie Ser Ala Leu 
350 355 360 

tgg agg cat gag gca get egg aga cat aga gaa 
Trp Arg His Glu Ala Ala Arg Arg His Arg Glu 
•370 ^ 375 

gag ata aat gag cca tea ttg tea teg aaa agg 
Glu He Asn Glu Pro Ser Leu Ser Ser Lys Arg 
385 390 

gee ace agt ggt gac aga caa aga gac tea gaa 
Ala Thr Ser Gly Asp Arg Glh Arg Asp Ser Glu 
400 405 

ctg aga aca gca aaa atg aaa cct gaa eta ctg 
Leu Arg Thr Ala Lys Met Lys Pro Glu Leu Leu 
415 420 " 

cca cgc cca get aat ttt ttt gta ttt tta gta 
Pro Arg Pro" Ala Asn Phe Phe Val Phe Leu Val 
430 435 440 

cgt gtt age cag gat gat etc gat etc ctg ace 
Arg Val Ser Gin Asp Asp Leu Asp Leu Leu Thr 



Val Cys Leu Lys Leu 
365 

tat atg gaa caa cag 1213 
Tyr Met Glu Gin Gin 
380 

aaa atg tgt gaa atg 1261 
Lys Met Cys Glu Met 
395 

gtt aat ttc aag gaa 1309 
Val Asn Phe Lys Glu 
410 

agt gga cac ate ccc 1357 

Ser Gly His He Pro 

425 

gag atg ggg ttt cac 1405 
Glu Met Gly Phe His 
445 

tea tgatccaccc 1451 
Ser 





450 




455 




gcctcagcct 


cccaaagtgt 


tgggattaca 


ggcgtgagtc accgcgccca gctggtattg 


1511 


cttttctatt 


ccctttggac 


atacatgeta 


cagtcccaca atgtagcatt tccttggaaa 


1571 


ctcccttttt 


tttttttttt 


gagatggagt 


ttcgctcttg ttgcccaggc tggagtacag 


1631 


tcjgtatgatc 


ttggctcact 


gcagcctctg 


cctcctgggt teaagegatt ctcctgcctc 


1691 


tgcctcccaa 


gtagctggga 


ttacaggcac 


ccaccaccat gcccagctaa ttttttgtat 


1751 


ttttagtaga 


gacaggattt 


cactatgttg 


gccaggttgg tctcaagctc ctgacctcag 


1811 


atgatctacc 


agcctcggcc 


ttctgaagtg 


ctgggattca ggtgtgagcc actgtgccca 


1871 


geagggatge 


ttcatctttc 


taagaattat 


cttggctttg gactttattc ataaatgttt 


1931 


tatttctgtt 


agtatgaaca 


atagactgee 


ttaacaaagt ttttttttaa acaaaatcgt 


1991 


tcttgttgga 


ttttattcag 


cagcatctat 


catgtagata aattcccagg tgtagcatta 


2051 


cagcttctga 


ctaatatagc 


tgccattcag 


acaattaatg ttcaaagagt tttctaaagt 


2111 


gataaaacca 


aagaaaagca 


tgtggaaaag 


cagaagctta gaaagttgtg gtcactgaat 


2171 


gcactccctg 


gtttttattt 


gtcagtgaaa 


tetttatgea ttcattgtta atattttaat 


2231 


tccatggctt 


tgtaggctgt 


gctgtgtctg 


aaggggtaac acctagggaa acatgaggee 


2291 


ccttatggga 


ccccccaaat 


ggaacaactt 


cactttctct tttatgtatt gagccctgtg 


2351 


ttaacatttc 


acttaagaag 


agcaccagtg 


cttt 


2385 



<210> 49 

<211> 457 

<212> PRT 

<213> NM_144724 FLJ30532 
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<400> 49 

Met Ser Asn Asp Gly Arg Ser Arg Asn Arg Asp Arg Arg Tyr Asp Glu 
1 5 10 15 



Val Pro Ser Asp Leu Pro Tyr Gin Asp- Thr Thr lie Arg Thr His Pro 
20 25 30 



lie Leu His Asp Ser Glu Arg Ala Val Ser Ala Asp Pro Leu Pro Pro 
35 40 • 45 



Pro Pro Leu Pro Leu Gin Pro Pro Phe Gly Pro Asp Phe Tyr Ser Ser 
50 55 60 



Asp Thr Glu Glu Pro Ala lie Ala Pro Asp Leu Lys Pro Val Arg Arg 
65 70 • 75 80 



Phe Val Pro Asp Ser Trp Lys Asn Phe Phe Arg Gly Lys Lys Lys Asp 
85 90 95 



Pro Glu Trp Asp Lys Pro Val Ser Asp lie Arg Tyr lie Ser Asp Gly 
100 105 110 



Val Glu Cys Ser Pro Pro Ala Ser Pro Ala Arg Pro Asn His Arg Ser 
115 120 125 



Pro Leu Asn Ser Cys Lys Asp Pro Tyr Gly Gly Ser Glu Gly Thr Phe 
130 135 140 



Ser Ser Arg Lys Glu Ala Asp Ala Val Phe Pro Arg Asp Pro Tyr Gly 
145 150 155 . 160 



Ser Leu Asp Arg His Thr Gin Thr Val Arg Thr Tyr Ser Glu Lys Val 
165 170 175 



Glu Glu Tyr Asn Leu Arg Tyr Ser Tyr Met Lys Ser Trp Ala Gly Leu 
180 ~ 185 190 



Leu Arg He Leu Gly Val Val Glu Leu Leu Leu Gly Ala Gly Val Phe 
195 * 200 205 



Ala Cys Val Thr Ala Tyr He His Lys Asp Ser Glu Trp Tyr Asn Leu 
210 215 220 



Phe Gly Tyr Ser Gin Pro Tyr Gly Met Gly Gly Val Gly Gly Leu Gly 
225 230 235 240 



Ser Met Tyr Gly Gly Tyr Tyr Tyr Thr Gly Pro Lys Thr Pro Phe Val 
245 250 255 
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Leu Val Val Ala Gly Leu Ala Trp lie Thr Thr lie lie He Leu Val 
260 265 270 



Leu Gly Met Ser. Met Tyr' Tyr Arg Thr lie Leu Leu Asp Ser Asn Trp 
275 280 . 285 



Trp Pro Leu Thr Glu Phe Gly He Asn Val Ala Leu Phe He Leu Tyr 
290 295 300 



Met Ala Ala Ala He Val Tyr Val Asn Asp Thr Asn Arg Gly Gly Leu 
305 310 315 320 



Cys Tyr Tyr Pro Leu Phe Asn Thr Pro Val Asn Ala Val Phe Cys Arg 
325 330 335 



Val Glu Gly Gly Gin He Ala Ala Met He Phe Leu Phe Val Thr Met 
340 345 350 



He Val Tyr Leu He Ser Ala Leu Val Cys Leu Lys Leu Trp Arg His 
355 360 365 



Glu Ala Ala Arg Arg His Arg Glu Tyr Met Glu Gin Gin Glu He Asn 
370 375 380 



Glu Pro Ser Leu Ser Ser Lys Arg Lys Met Cys Glu Met Ala Thr Ser 
385 390 395 400 



Gly Asp Arg Gin Arg Asp Ser Glu Val Asn Phe Lys Glu Leu Arg Thr 
405 410 415 



Ala Lys Met Lys Pro Glu Leu Leu Ser Gly His He Pro Pro Arg Pro 
420 425 ~ 430 



Ala Asn Phe Phe Val Phe Leu Val Glu Met Gly Phe His Arg Val Ser 
435 440 445 



Gin Asp Asp Leu Asp Leu Leu Thr Ser 
450 455 



<210> 50 

<211> 2280 

<212> DNA 

<213> NM 006424 SLC34A2 



<220> 



<221> CDS 
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<222> <64)..(2133> 
<223> 



<400> 50 

cgggccaggt ttccaggctc ggccgccgcc tccatcccag cacctgcgga gggagcgctg 60 

acc atg get ccc tgg cct gaa ttg gga gat gec cag ccc aac ccc gat 108 
Met Ala Pro Trp Pro' Glu Leu Gly Asp Ala Gin Pro Asn Pro Asp 
15 10 15 

aag tac etc gaa ggg gec gca ggt cag cag ccc act gec cct gat aaa 156 
Lys Tyr Leu Glu Gly Ala Ala Gly Gin Gin Pro Thr Ala Pro Asp Lys 
20 25 30 

age aaa gag acc aac aaa aca gat aac act gag gca cct gta acc aag 204 
Ser Lys Glu Thr Asn Lys Thr Asp Asn Thr Glu Ala Pro Val Thr Lys 
35 40 45 

att gaa ctt ctg ccg tec tac tec acg get aca. ctg ata gat gag ccc 252 
lie Glu Leu Leu Pro Ser Tyr Ser Thr Ala Thr Leu He Asp Glu Pro 
50 55 60 

act gag gtg gat gac ccc tgg aac eta ccc act ctt cag gac teg ggg 300 
Thr Glu Val Asp Asp Pro Trp Asn Leu Pro Thr Leu Gin Asp Ser Gly 
65 70 75 

ate aag tgg tea gag aga gac acc aaa ggg aag att etc tgt ttc ttc 348 
He Lys Trp Ser Glu Arg Asp Thr Lys Gly Lys He Leu Cys Phe Phe 
80 85 90 95 

caa ggg att ggg aga ttg att tta ctt etc gga ttt etc tac ttt ttc 396 
Gin Gly He Gly Arg Leu He Leu Leu Leu Gly Phe Leu Tyr Phe Phe 
100 105 " HO 

gtg tgc tec ctg gat att ctt agt age gec ttc cag ctg gtt gga gga 444 
Val Cys Ser Leu Asp He Leu Ser Ser Ala Phe Gin Leu Val Gly Gly 
115 120 125 

aaa atg gca gga cag ttc ttc age aac age tct att atg tec aac cct 492 
Lys Met Ala Gly Gin Phe Phe Ser Asn Ser Ser He Met Ser Asn Pro 
130 135 140 

ttg ttg ggg ctg gtg ate ggg gtg ctg gtg acc gtc ttg gtg cag age 540 
Leu Leu Gly Leu Val He Gly Val Leu Val Thr Val Leu Val Gin Ser 
145 150 155 

tec age acc tea acg tec ate gtt gtc age atg gtg tec tct tea ttg 588 
Ser Ser Thr Ser Thr Ser He Val Val Ser Met Val Ser Ser Ser Leu 
160 165 170 175 

etc act gtt egg get gec ate ccc att ate atg ggg gec aac att gga 636 
Leu Thr Val Arg Ala Ala He Pro He He Met Gly Ala Asn He Gly 
180 185 190 

acg tea ate acc aac act att gtt gcg etc atg cag gtg gga gat egg 684 
Thr Ser He Thr Asn Thr He Val Ala Leu Met Gin Val Gly Asp Arg 
195 200 205 

agt gag ttc aga aga get ttt gca gga gec act gtc cat gac ttc ttc 732 
Ser Glu Phe Arg Arg Ala Phe Ala Gly Ala Thr Val His Asp Phe Phe 
210 215 220 

aac tgg ctg tec gtg ttg gtg etc ttg ccc gtg gag gtg gec acc cat 780 
Asn Trp Leu Ser Val Leu Val Leu Leu Pro Val Glu Val Ala Thr His 
225 230 235 
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tac etc gag ate ata acc cag ctt ata gtg gag age ttc cac ttc aag 828 
Tyr Leu Glu lie lie Thr Gin Leu He Val Glu Ser Phe His Phe Lys 
240 245 250 . . 255 

aat gga gaa gat gee cca gat ctt c.tg aaa gtc ate act aag ccc ttc 
Asn Gly Glu Asp Ala Pro Asp Leu Leu Lys Val He Thr Lys Pro Phe 
260 265 270 

aca aag etc att gtc cag ctg gat aaa aaa gtt ate age caa att gca 
Thr Lys Leu He Val Gin Leu Asp Lys Lys Val He Ser Gin He Ala 
275 280 285 

atg aac gat gaa aaa gcg aaa aac aag agt ctt gtc aag att tgg tgc 
Met Asn Asp Glu Lys Ala Lys Asn Lys Ser Leu Val Lys He Trp Cys 
290 295 300 

aaa act ttt acc aac aag acc cag att aac gtc act gtt ccc teg act 1020 
Lys Thr Phe Thr Asn Lys Thr Gin He Asn Val Thr Val Pro Ser Thr 
305 310 315 



gtc aag ate ctg ggc tct gtg etc aag ggg cag gtc gee act gtc ate 
Val Lys He Leu Gly Ser Val Leu Lys Gly Gin Val Ala Thr Val He 
385 390 395 

aag aag acc ate aac act gat ttc ccc ttt ccc ttt gca tgg ttg act 
Lys Lys Thr He Asn Thr Asp Phe Pro Phe Pro Phe Ala Trp. Leu Thr 
400 405 410 415 



876 



924 



972 



get aac tgc acc tec cct tec etc tgt tgg acg gat ggc. ate caa aac 1068 
Ala Asn Cys Thr Ser Pro Ser Leu Cys Trp Thr Asp Gly He Gin Asn 
320 325 . 330 335 

tgg acc atg aag aat gtg acc tac aag gag aac ate gee aaa tgc cag 1116 
Trp Thr Met Lys Asn Val Thr Tyr Lys Glu Asn He Ala Lys Cys Gin 
340 345 350 

cat ate ttt gtg aat ttc cac etc ccg gat ctt get gtg ggc acc ate 1164 
His He Phe Val Asn Phe His Leu Pro Asp Leu Ala Val Gly Thr He 
355 360 365 

ttg etc ata etc tec ctg ctg gtc etc tgt ggt tgc ctg ate atg att 1212 
Leu Leu He Leu Ser Leu Leu Val Leu Cys Gly Cys Leu He Met He 
370 375 380 



1260 



1308 



ggc tac ctg gee ate etc gtc ggg gca ggc atg acc ttc ate gta cag 1356 
Gly Tyr Leu Ala He Leu Val Gly Ala Gly Met Thr Phe He Val Gin 
420 425 430 

age age tct gtg ttc acg teg gec ttg acc ccc ctg att gga ate ggc 1404 
Ser Ser Ser Val Phe Thr Ser Ala Leu Thr Pro Leu He Gly He Gly 
435 440 445 



gtg ata acc att gag agg get tat cca etc acg ctg ggc tec aac ate 
Val He Thr He Glu Arg Ala Tyr Pro Leu Thr Leu Gly Ser Asn He 
450 455 460 

ggc acc acc acc acc gee ate ctg gee gee tta gee age cct ggc aat 
Gly Thr Thr Thr Thr Ala He Leu Ala Ala Leu Ala Ser Pro Gly Asn 
465 470 475 

gca ttg agg agt tea etc cag ate gee ctg tgc cac ttt ttc ttc aac 
Ala Leu Arg Ser Ser Leu Gin He Ala Leu Cys His Phe Phe Phe Asn 
480 " 485 490 495 

ate tec ggc ate ttg ctg tgg tac ccg ate ccg ttc act cgc ctg ccc 
He Ser Gly He Leu Leu Trp Tyr Pro He Pro Phe Thr Arg Leu Pro 
500 505 510 



1452 



1500 



1548 



1596 
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ate cgc atg gec aag ggg ctg ggc aac ate tct gec aag tat cgc tgg 1644 
lie Arg Met Ala Lys Gly Leu Gly Asn lie Ser Ala Lys Tyr Arg Trp 
515 • 520 . 525 

ttc gec gtc ttc tac ctg ate a£c ttc ttc ttc ctg ate ccg ctg acg 1692 
Phe' Ala Val Phe Tyr Leu lie lie Phe .Phe Phe Leu lie Pro Leu Thr 
530 535 540 

gtg tttggc etc teg ctg gec ggc tgg egg gtg ctg gtt ggt gtc ggg 1740 
Val Phe Gly Leu Ser Leu Ala Gly Trp -Arg. Val Leu Val Gly Val Gly 
545 550 555 

gtt ccc gtc gtc ttc ate ate ate ctg gta ctg tgc etc cga etc ctg 1788 
Val Pro Val Val Phe He He He Leu Val Leu Cys Leu Arg Leu Leu 
560 565 570 . 575 

cag tct cgc tgc cca cgc gtc ctg ccg aag aaa etc cag aac tgg aac 1836 
Gin Ser Arg Cys Pro Arg Val Leu Pro Lys Lys Leu Gin Asn Trp Asn 
580 585 590 

ttc ctg ccg ctg tgg atg cgc teg. ctg aag ccc tgg gat gec gtc gtc 1884 
Phe Leu Pro Leu Trp Met Arg Ser Leu Lys Pro Trp Asp Ala Val Val 
595 600 605 

tec aag ttc ace ggc tgc ttc cag atg cgc tgc tgc tac tgc tgc cgc 1932 
Ser Lys Phe Thr Gly Cys Phe Gin Met Arg Cys Cys Tyr Cys Cys Arg 
610 615 620 

gtg tgc tgc cgc gcg tgc tgc ttg ctg tgt ggc tgc ccc aag tgc tgc 1980 
Val Cys Cys Arg Ala Cys Cys Leu Leu Cys Gly Cys Pro Lys Cys Cys 
625 630 • 635 

cgc tgc age aag tgc tgc gag gac ttg gag gag gcg cag gag ggg cag 2028 
Arg Cys Ser Lys Cys Cys Glu Asp Leu Glu Glu Ala Gin Glu Gly Gin 
640 645 650 655 

gat gtc cct gtc aag get cct gag ace ttt gat aac ata acc att age 2076 
Asp Val Pro Val Lys Ala Pro Glu Thr Phe Asp Asn He Thr He Ser 
660 665 670 

aga gag get cag ggt gag gtc cct gee teg gac tea aag acc gaa tgc 2124 
Arg Glu Ala Gin Gly Glu Val Pro Ala Ser Asp Ser Lys Thr Glu Cys 
675 680 685 

acg gee ttg taggggaege cccagattgt cagggatggg gggatggtcc 2173 
Thr Ala Leu 
690 

ttgagttttg catgctctcc tccctcccac ttctgcaccc tttcaccacc tcgaggagat 2233 
ttgctcccca ttagcgaatg aaattgatgc agtcctaaaa aaaaaaa 2280 



<210> 51 

<211> 690 

<212> PRT 

<213> m 006424 SLC34A2 



<400> 51 



Met Ala Pro Trp Pro Glu Leu Gly Asp Ala Gin Pro Asn Pro Asp Lys 
15 10 15 
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Tyr Leu Glu Gly Ala Ala Gly Gin Gin Pro Thr Ala Pro Asp Lys Ser 
20 25 30 



Lys Glu Thr Asn Lys Thr Asp Asn Thr Glu Ala Pro Val Thr Lys He 
35 40 45 



Glu Leu Leu Pro Ser Tyr Ser Thr Ala Thr Leu He Asp Glu Pro Thr 
50 55 60 



Glu Val Asp Asp Pro Trp Asn Leu Pro Thr Leu Gin Asp Ser Gly He 
65 70 75 80 



Lys Trp Ser Glu Arg Asp Thr Lys Gly Lys He Leu Cys Phe Phe Gin 
85 90 95 



Gly He Gly Arg Leu He Leu Leu Leu Gly Phe Leu Tyr Phe Phe Val 
100 105 110 



Cys Ser Leu Asp He Leu Ser Ser Ala Phe Gin Leu Val Gly Gly Lys 
115 120 125 



Met Ala Gly Gin Phe Phe Ser Asn Ser Ser He Met Ser Asn Pro Leu 
130 135 140 



Leu Gly Leu Val He Gly Val Leu Val Thr Val Leu Val Gin Ser Ser 
145 150 • 155 160 



Ser Thr Ser Thr Ser He Val Val Ser Met Val Ser Ser Ser Leu Leu 
. 165 170 175 



Thr Val Arg Ala Ala He Pro He He Met Gly Ala Asn He Gly Thr 
180 185 190 



Ser He Thr Asn Thr He Val Ala Leu Met Gin Val Gly Asp Arg Ser 
195 200 205 



Glu Phe Arg Arg Ala Phe Ala Gly Ala Thr Val His Asp Phe Phe Asn 
210 215 220 



Trp Leu Ser Val Leu Val Leu Leu Pro Val Glu Val Ala Thr His Tyr 
225 230 235 240 



Leu Glu He He Thr Gin Leu He Val Glu Ser Phe His Phe Lys Asn 
245 250 255 



Gly Glu Asp Ala Pro Asp Leu Leu Lys Val He Thr Lys Pro Phe Thr 
260 265 270 



Lys Leu He Val Gin Leu Asp Lys Lys Val He Ser Gin He Ala Met 



o o 

WO 2004/022778 PCT/AU2003/001166 

134 

275 280 285 

Asn. Asp Glu Lys Ala Lys Asn Lys Ser Leu Val Lys He Trp Cys Lys 
290 295 300 

Thr Phe Thr Asn Lys Thr Gin He Asn Val Thr Val Pro Ser Thr Ala 
305 310 315 320 

Asn Cys Thr Ser Pro Ser Leu Cys Trp Thr Asp Gly He Gin Asn Trp 
325 33Q 335 

Thr Met Lys Asn Val Thr Tyr Lys Glu Asn He Ala Lys Cys Gin His 
340 345 350 

He Phe Val Asn Phe His Leu Pro Asp Leu Ala Val Gly Thr He Leu 
355 360 365 

Leu He Leu Ser Leu Leu Val Leu Cys Gly Cys Leu He Met He Val 
370 375 380 

Lys He Leu Gly Ser Val Leu Lys Gly Gin Val Ala Thr Val He Lys 
385 390 395 400 

Lys Thr He Asn Thr Asp Phe Pro Phe. Pro Phe Ala Trp Leu Thr Gly 
405 410 415 

Tyr Leu Ala He Leu Val Gly Ala Gly Met Thr Phe He Val Gin Ser 
420 425 430 

' Ser Ser Val Phe Thr Ser Ala Leu Thr Pro Leu He Gly He Gly Val 
435 440 445 

He Thr He Glu Arg Ala Tyr Pro Leu Thr Leu Gly Ser Asn He Gly 
450 455 460 

Thr Thr Thr Thr Ala He Leu Ala Ala Leu Ala Ser Pro Gly Asn Ala 
465 470 475 480 

Leu Arg Ser Ser Leu Gin He Ala Leu Cys His Phe Phe Phe Asn He 
485 490 495 

Ser Gly He Leu Leu Trp Tyr Pro He Pro Phe Thr Arg Leu Pro He 
500 505 510 

Arg Met Ala Lys Gly Leu Gly Asn He Ser Ala Lys Tyr Arg Trp Phe 
515 520 525 

Ala Val Phe Tyr Leu He He Phe Phe Phe Leu He Pro Leu Thr Val 
530 535 540 



Phe Gly Leu Ser Leu Ala Gly Trp Arg Val Leu Val Gly Val Gly Val 
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545 550 555 560 



Pro Val Val Phe He He He Leu Val Leu Cys Leu Arg Leu Leu Gin 
565 570 575 



Ser Arg Cys Pro Arg Val Leu Pro Lys Lys Leu Gin Asn Trp Asn Phe 
580 585 590 



Leu Pro Leu Trp Met Arg Ser Leu Lys Pro Trp Asp Ala Val Val Ser 
595 600 605 



. Lys Phe Thr Gly Cys Phe Gin Met Arg Cys Cys Tyr Cys Cys Arg Val 
610 615 620 



Cys Cys Arg Ala Cys Cys Leu Leu Cys Gly Cys Pro Lys Cys Cys Arg 
625 630 635 640 



Cys Ser Lys Cys Cys Glu Asp Leu Glu Glu Ala- Gin Glu Gly Gin Asp 
645 650 655 



Val Pro Val Lys Ala Pro Glu Thr Phe Asp Asn He Thr He Ser Arg 
660 665 . 670 



Glu Ala Gin Gly Glu Val Pro Ala Ser Asp Ser Lys Thr Glu Cys Thr 
675 680 685 



Ala Leu 
690 



<210> 52 

<211> 529 

<212> DNA 

<213> AW959311, DKFZp434J037, EST 
<220> 

<221> misc_feature 

<222> (393).. (393) 

<223> the residue at position 393 is A, T C or G 



<400> 52 






ctgtttcttc 


aatggttctc ttcccttttc catcctccaa acctggcctg agcctcctga 


60 


agttgctgct 


gtgaatctga aagacttgaa aagcctccac ctgctgtgtg gacttcatct 


120 


caaggggccc 


agcctcctct ggactccacc ttggacctca gtgactcaga acttctgcct 


180 


ctaagctgct 


ctaaagtcca gactatggat gtgttctcta ggccttcagg actctagaat 


240 
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gtccatattt 


atttttatgt tcttggcttt gtgttttagg aaaagtgaat cttgctgttt 


300 


tcaataatgt 


gaatgctatg ttctgggaaa atccactatg acatctaagt tttgtgtaca 


360 


gagagatatt 


tttgcaacta tttccacctt ctnccacaac cccccacact ccactccaca 


420 


ctcttgagtc 


tctttaccta atggtctcta cctaatggac cctcgtggcc aaaaagtcca 


480 


ttaaaccaga 


aaggtgattg gaaaaaaaaa aaaaaaaact cgagggggg 


529 



<210> 53 

<211> 2100 

<212> DNA 

<213> AF111713 JAM1, junctional adhesion molecule 1 
<220> 

<221>. CDS 

<222> (287) . . (1183) 

<223> 



<400> 53 



cgaggccgag agggcgggcc agggccgcac tccggagact 


cgcggttgct 


acgcgcacca 


60 


tggctggaga cggagtttcg ctcttgatgc ccagcaggct 


ggagtgcgat 


ggcgcgattt 


120 


cggctcactg caacctcctc ctcccagagg tacttctcag 


ccctctagct 


ccaactgaga 


180 


acccagccag tcaggaagtc gctacttcgg gaacaccaac 


caatcagggg 


gccgtcacct 


240 


gctgaaggag tccttcggcg gctgttgtgt cgggagcctg 


atcgcg atg 


ggg aca 


295 




Met 
1 


Gly Thr 





aag gcg caa gtc gag agg aaa ctg ttg tgc etc ttc ata ttg gcg ate 343 
Lys Ala Gin Val Glu Arg Lys Leu Leu Cys Leu Phe He Leu Ala He 
5 10 15 



ctg ttg tgc tec ctg gca ttg ggc agt gtt aca gtg cac tct tct gaa 391 
Leu Leu Cys Ser Leu Ala Leu Gly Ser Val Thr Val His Ser Ser Glu 
20 25 30 35 

cct gaa gtc aga att cct gag aat aat cct gtg aag ttg tec tgt gee 439 
Pro Glu Val Arg He Pro Glu Asn Asn Pro Val Lys Leu Ser Cys Ala 
40 45 50 

tac teg ggc ttt tct tct ccc cgt gtg gag tgg aag ttt gac caa gga 487 
Tyr Ser Gly Phe Ser Ser Pro Arg Val Glu Trp Lys Phe Asp Gin Gly 
55 60 65 

gac ace ace aga etc gtt tgc tat aat aac aag ate aca get tec tat 535 
Asp Thr Thr Arg Leu Val Cys Tyr Asn Asn Lys He Thr Ala Ser Tyr 
70 75 80 

gag gac egg gtg ace ttc ttg cca act ggt ate acc ttc aag tec gtg 583 
Glu Asp Arg Val Thr Phe Leu Pro Thr Gly He Thr Phe Lys Ser Val 
85 90 95 
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aca egg gaa gac act ggg aca tac act tgt atg gtc tct gag gaa ggc 631 
Thr Arg Glu Asp Thr Gly Thr Tyr Thr Cys Met Val Ser Glu Glu Gly 
100 105 110 115 

ggc aac age tat ggg gag gtc aag gtc aag etc ate gtg ctt gtg cct 679 
Gly Asn Ser Tyr Gly Glu Val Lys Val Lys Leu lie Val Leu Val Pro 
120 125 130 

cca tec aag cct aca gtt aac ate ccc tec tct gec acc att ggg aac 727 
Pro Ser Lys Pro Thr Val Asn lie Pro Ser Ser Ala Thr lie Gly Asn 
135 140 145 

egg gca gtg ctg aca tgc tea gaa caa gat ggt tec cca cct tct gaa 775 
Arg Ala Val Leu Thr Cys Ser Glu Gin Asp Gly Ser Pro Pro Ser Glu 
150 155 160 

tac acc tgg ttc aaa gat ggg ata gtg atg cct acg aat ccc aaa age 823 
Tyr Thr Trp Phe Lys Asp Gly lie Val Met Pro Thr Asn Pro Lys Ser 
165 170 175 

acc cgt gee ttc age aac tct tec tat gtc ctg aat ccc aca aca gga 871 
Thr Arg Ala Phe Ser Asn Ser Ser Tyr Val Leu Asn Pro Thr Thr Gly 
180 185 190 195 

gag ctg gtc ttt gat ccc ctg tea gec tct gat act gga gaa tac age 919 
Glu Leu Val Phe Asp Pro Leu Ser Ala Ser Asp Thr Gly Glu Tyr Ser 
200 205 210 

tgt gag gca egg aat ggg tat ggg aca .ccc atg act tea aat get gtg 967 
Cys Glu Ala Arg Asn Gly .Tyr Gly Thr Pro Met Thr Ser Asn Ala Val 
215 220 225 

cgc atg gaa get gtg gag egg aat gtg ggg gtc ate gtg gca gec gtc 1015 
Arg Met Glu Ala Val Glu Arg Asn Val Gly Val He Val Ala Ala Val 
230 235 240 

ctt gta acc ctg att etc ctg gga ate ttg gtt ttt ggc ate tgg ttt 1063 
Leu Val Thr Leu He Leu Leu Gly He Leu Val Phe Gly Xle Trp Phe 
245 250 255- 

gec tat age cga ggc cac ttt gac aga aca aag aaa ggg act teg agt 1111 
Ala Tyr Ser Arg Gly His Phe Asp Arg Thr Lys Lys Gly Thr Ser Ser 
260 265 270 275 

aag aag gtg att tac age cag cct agt gec cga agt gaa gga gaa ttc 1159 
Lys Lys Val He Tyr Ser Gin Pro Ser Ala Arg Ser Glu Gly Glu Phe 





280 




285 




290 




aaa cag acc teg tea ttc ctg gtg tgagcctggt cggctcaccg cctatcatct 
Lys Gin Thr Ser Ser Phe Leu Val 
295 


1213 


geatttgect 


tactcaggtg 


ctactggact 


ctggcccctg 


atgtctgtag 


tttcacagga 


1273 


tgccttattt 


gtcttctaca 


ccccacaggg 


ccccctactt 


etteggatgt 


gtttttaata 


1333 


atgtcagcta 


tgtgccccat 


cctccttcat 


gccctccctc 


cctttcctac 


cactgetgag 


1393 


tggcctggaa 


cttgtttaaa 


gtgtttattc 


cccatttctt 


tgagggatca 


ggaaggaatc 


1453 


ctgggtatgc 


cattgacttc 


ccttctaagt 


agacagcaaa 


aatggcgggg 


gtcgcaggaa 


1513 


tctgcactca 


actgcccacc 


tggctggcag 


ggatctttga 


ataggtatct 


tgagcttggt 


1573 


tctgggctct 


ttccttgtgt 


actgacgacc 


agggecaget 


gttctagagt 


gggaattaga 


1633 


ggctagagcg 


gctgaaatgg 


ttgtttggtg 


atgacactgg 


ggtccttcca 


tctctggggc 


1693 
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ccactctctt 


ctgtcttccc atgggaagtg 


ccactgggat 


ccctctgccc tgtcctcctg 


1753 


aatacaagct 


gactgacatt gactgtgtct 


gtggaaaatg 


ggagctcttg ttgtggagag 


1813 


catagtaaat 


tttcagagaa cttgaagcga 


aaaggattta 


aaaccgctgc tctaaagaaa 


1873 


agaaaactgg 


aggctgggcg cagtggctca 


cgcctgtaat 


cccagaggct gaggcaggcg 


1933 


gatcacctga 


ggtcgggagt tcggg'atcag 


cctgaccaac 


atggagaaac cctgctggaa 


1993 


atacagagtt 


agccaggcat ggtggtgcat 


gcctgtagtc 


ccagctgctc aggagcctgg 


2053 


caacaagagc 


aaaactccag ctcaaaaaaa 


aaaaaaaaaa 


aaaaaaa 


2100 



<210> 54 

<211> 299 

<212> PRT 

<213> AF111713 JAM1, junctional adhesion molecule 1 

<400> 54 

Met Gly Thr Lys Ala Gin Val Glu Arg Lys Leu Leu Cys Leu Phe He 
1 5 10 15 



Leu Ala He Leu Leu Cys Ser Leu Ala Leu Gly Ser Val Thr Val His 
20 25 30 



Ser Ser Glu Pro Glu Val Arg He Pro Glu Asn Asn Pro Val Lys Leu 
35 40 45 



Ser Cys Ala Tyr Ser Gly Phe Ser Ser. Pro Arg Val Glu Trp Lys Phe 
50 55 60 



Asp Gin Gly Asp Thr Thr Arg Leu Val Cys Tyr Asn Asn Lys He Thr 
65 70 .75 80 



Ala Ser Tyr Glu Asp Arg Val Thr Phe Leu Pro Thr Gly He Thr Phe 
85 90 95 



Lys Ser Val Thr Arg Glu Asp Thr Gly Thr Tyr Thr Cys Met Val Ser 
100 105 110 



Glu Glu Gly Gly Asn Ser Tyr Gly Glu Val Lys Val Lys Leu He Val 
115 120 125 



Leu Val Pro Pro Ser Lys Pro Thr Val Asn He Pro Ser Ser Ala Thr 
130 135 140 



He Gly Asn Arg Ala Val Leu Thr Cys Ser Glu Gin Asp Gly Ser Pro 
145 150 155 * ~ 160 
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Pro Ser Glu Tyr Thr Trp Phe Lys Asp Gly lie Val Met Pro Thr Asn 
165 170 175 



Pro Lys Ser Thr Arg Ala Phe Ser Asn Ser Ser Tyr Val Leu Asn Pro 
180 185 190 



Thr Thr Gly Glu Leu Val Phe Asp Pro Leu Ser Ala Ser Asp Thr Gly 
195 200 205 



Glu Tyr Ser Cys Glu Ala Arg Asn Gly Tyr Gly Thr Pro Met Thr Ser 
210 215 220 



Asn Ala Val Arg Met Glu Ala Val Glu Arg Asn Val Gly Val He Val 
225 230 235 240 



Ala Ala Val Leu Val Thr Leu He Leu Leu Gly He Leu Val Phe Gly 
245 250 255 



He Trp Phe Ala Tyr Ser Arg Gly His Phe Asp Arg Thr Lys Lys Gly 
260 265 270 



Thr Ser Ser Lys Lys Val He Tyr Ser Gin Pro Ser Ala Arg Ser Glu 
275 280 285 



Gly Glu Phe Lys Gin Thr Ser Ser Phe Leu Val 
290 295 



<210> 55 

<211> 2154 

<212> DNA 

<213> NM_006636 MTHFD2, methylene tetrahydrof olate 
<220> 

<221> CDS 

<222> (77) (1108) 

<223> 



<400> 55 

atataaccgc gtggcccgcg cgcgcgcttc cctcccggcg cagtcaccgg cgcggtctat 60 

ggctgcgact tctcta atg tct get ttg get gee egg ctg ctg cag ccc gcg 112 
Met Ser Ala Leu Ala Ala Arg Leu Leu Gin Pro Ala 
1 5 10 

cac age tgc tec ctt cgc ctt cgc cct ttc cac etc gcg gca gtt cga 160 
His Ser Cys Ser Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg 
15 20 25 
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aat gaa get gtt gtc att tct gga agg aaa ctg gee cag cag ate aag 208 
Asn Glu Ala Val Val He Ser Gly Arg Lys Leu Ala .Gin Gin He Lys 
30 35 40- 

cag gaa gtg egg cag gag gta gaa gag tgg gtg gec tea ggc aac aaa 
Gin Glu Val Arg Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys 
45 50 55 60 

egg cca cac ctg agt gtg ate ctg gtt ggc gag aat cct gca agt cac 
Arg Pro His Leu Ser Val He Leu Val Gly Glu Asn Pro Ala Ser His 
65 70 75 

tec tat gtc etc aac aaa ace agg gca get gca gtt gtg gga ate aac 
Ser Tyr Val Leu Asn Lys Thr Arg Ala Ala Ala Val Val Gly He Asn 
80 85 90 

agt gag aca att atg aaa cca get tea att tea gag' gaa gaa ttg ttg 
Ser Glu Thr He Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu 
95 100 . \ 105 

aat tta ate aat aaa ctg aat aat gat gat aat gta gat ggc etc ctt 448 
Asn Leu He Asn Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu 
110 115 120 



gtt cag ttg cct ctt cca gag cat att gat gag aga agg ate tgc aat 
Val Gin Leu Pro Leu Pro Glu His He Asp Glu Arg Arg He Cys Asn 
125 130 135 140 



act gtt aca ata tct cat cga tat act ccc aaa gag cag ttg aag aaa 
Thr Val Thr He Ser His Arg Tyr Thr Pro Lys Glu Gin Leu Lys Lys 
225 230 235 

cat aca att ctt gca gat att gta ata tct get gca ggt att cca aat 
His Thr He Leu Ala Asp He Val He Ser Ala Ala Gly He Pro Asn 
240 245 250 

ctg ate aca gca gat atg ate aag gaa gga gca gca gtc att gat gtg 
Leu He Thr Ala Asp Met He Lys Glu Gly Ala Ala Val He Asp Val 
255 * 260 265 



gga ata aat aga gtt cac gat cct gta act gec aaa ccc aag ttg gtt 
Gly He Asn Arg Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val 
270 275 280 



gga gat gtg gat ttt gaa gga gtc aga caa aaa get ggg tat ate act 
Gly Asp Val Asp Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr He Thr 
285 290 . 295 300 



256 



304 



352 



400 



496 



get gtt tct cca gac aag gat gtt gat ggc ttt cat gta att aat gta 544 
Ala Val Ser Pro Asp Lys Asp Val Asp Gly Phe His Val He Asn Val 
145 150 155 

gga cga atg tgt ttg gat cag tat tec atg tta ccg get act cca tgg 
Gly Arg Met Cys Leu Asp Gin Tyr Ser Met Leu Pro Ala Thr Pro Trp 
160 165 - 170 



592 



688 



ggt gtg tgg gaa ata ate aag cga act ggc att cca ace eta ggg aag 640 
Gly Val Trp Glu He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys 
175 180 185 

aat gtg gtt gtg get gga agg tea aaa aac gtt gga atg ccc att gca 
Asn Val Val Val Ala Gly Arg Ser Lys Asn Val Gly Met Pro He Ala 
190 195 200 

atg tta ctg cac aca gat ggg gcg cat gaa cgt ccc gga ggt gat gee 736 
Met- Leu Leu His Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala 
205 210 215 220 



784 



832 



880 



928 



976 



Met Ser Ala Leu Ala Ala Arg Leu Leu Gin Pro Ala His Ser Cys Ser 
1 5 10 15 



Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg Asn Glu Ala Val 
20 25 30 
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cca gtt cct gga ggt gtt ggc ccc atg aca gtg gca- atg eta atg aag 1024 
Pro Val Pro Gly Gly Val Gly Pro Met Thr Val Ala Met Leu Met Lys 
305 310 315 

aat acc att att get gca aaa aag gtg ctg agg ctt gaa gag cga gaa 1072 
Asn Thr lie lie Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu 
320 325 330 

gtg ctg aag tct aaa gag ctt ggg gta gec act aat taactactgt 1118 
Val Leu Lys Ser Lys Glu Leu Gly Val Ala Thr Asn 
335 340 

gtcttctgtg tcacaaacag cactccaggc cagctcaaga agcaaagcag gecaatagaa 1178 

atgeaatatt tttaatttat tctactgaaa tggtttaaaa tgatgccttg tatttattga 1238 

aagcttaaat gggtgggtgt ttctgeacat acctctgcag tacctcacca gggagcattc 1298 

cagtatcatg cagggtcctg tgatctagee aggagcagee attaacctag tgattaatat 1358 

gggagacatt accatatgga ggatggatgc ttcactttgt caagcacctc agttacacat 1418 

tcgccttttc taggattgea tttcccaagt getattgeaa taacagttga tactcatttt 1478 

aggtaccaga ccttttgagt tcaactgatc aaaccaaagg aaaagtgttg ctagagaaaa 1538 

ttggggaaaa ggtgaaaaag aaaaaatggt agtaattgag cagaaaaaaa ttaatttata 1598 

tatgtattga ttggcaacca gatttatcta agtagaactg aattggctag gaaaaaagaa 1658 

aaactgcatg ttaatcattt tcctaagctg tccttttgag gcttagtcag tttattggga 1718 

aaatgtttag gattattcct tgctattagt actcatttta tgtatgttac ccttcagtaa 1778 

gttctcccca ttttagtttt ctaggactga aaggattctt ttctacatta tacatgtgtg 1838 J 

i 

ttgtcatatt tggcttttgc tatatacttt aacttcattg ttaaattttt gtattgtata 1898 

gtttctttgg tgtatcttaa aacctatttt tgaaaaacaa acttggcttg ataatcattt 1958 

gggcagcttg ggtaagtacg caacttactt ttccaccaaa gaactgtcag cagctgcctg 2018 

cttttctgtg atgtatgtat cctgttgact tttccagaaa ttttttaaga gtttgagtta 2078 

ctattgaatt taatcagact ttctgattaa agggttttct ttctttttta ataaaacaca 2138 

tctgtctggt atggta 2154 

<210> 56 \ 

<211> 344 ij 

1 

<212> PRT | 

] 

<213> NM_006636 MTHFD2, methylene tetrahydrofolate j 

i 
| 

<400> 56 i 
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Val lie Ser Gly Arg Lys Leu Ala Gin Gin He Lys Gin Glu Val Arg 
35 40 45 



Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys Arg Pro His Leu 
50 55 60 



Ser Val He Leu Val Gly Glu Asn Pro Ala Ser His Ser Tyr Val Leu 
65 70 75 80 



Asn Lys Thr Arg Ala Ala Ala Val Val Gly lie . Asn Ser Glu Thr He 
85 90 95 



Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu Asn Leu He Asn 
100 105 110 



Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu Val Gin Leu Pro 
115 120 125 



Leu Pro Glu His He Asp Glu Arg Arg He Cys Asn Ala Val Ser Pro 
130 135 140 



Asp Lys Asp Val Asp Gly Phe His Val He Asn Val Gly Arg Met Cys 
145 150 155 160 



Leu Asp Gin Tyr Ser Met Leu Pro Ala Thr Pro Trp Gly Val Trp Glu 
165 170 175 



He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys Asn Val Val Val 
180 185 190 



Ala Gly Arg Ser Lys Asn Val Gly Met Pro He Ala Met Leu Leu His 
195 200 205 



Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala Thr Val Thr He 
210 215 220 



Ser His Arg Tyr Thr Pro Lys Glu Gin Leu Lys Lys His Thr He Leu 
225 230 235 240 



Ala Asp He Val He Ser Ala Ala Gly He Pro Asn Leu He Thr Ala 
245 250 255 



Asp Met He Lys Glu Gly Ala Ala Val He Asp Val Gly He Asn Arg 
260 265 270 



Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val Gly Asp Val Asp 
275 280 285 



Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr He Thr Pro Val Pro Gly 
290 295 300 
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Gly Val Gly Pro Met Thr Val' Ala Met Leu Met Lys Asn Thr lie He 
305 310 315 320 

Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu Val Leu Lys Ser 
325 330 335 

Lys Glu Leu Gly Val Ala Thr Asn 
340 

<210> 57 
<211> 1117 
<212> DNA 

<213> NM_006149 galectin 4, LGALS4 



<220> 

<221> CDS 

<222> (57) . . (1025) 

<223> 



<400> 57 

atctcccact cctgcagctc ttctcacagg accagccact agcgcagcct cgagcg atg 59 

Met 
1 

gcc tat gtc ccc gca ccg ggc tac cag ccc acc tac aac ccg acg ctg 107 
Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr Leu 
5 10 15 

cct tac tac cag ccc ate ccg ggc ggg etc aac gtg gga atg tct gtt 155 
Pro Tyr Tyr Gin Pro He Pro Gly Gly Leu Asn Val Gly Met Ser Val 
20 25 30 

tac ate caa gga gtg gcc age gag cac atg aag egg ttc ttc gtg aac 203 
Tyr He Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val Asn 
35 40 45 

ttt gtg gtt ggg cag gat ccg ggc tea gac gtc gcc ttc cac ttc aat 251 
Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe Asn 
50 55 60 65 

ccg egg ttt gac ggc tgg gac aag gtg gtc ttc aac acg ttg cag ggc 299 
Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin Gly 
70 75 80 

ggg aag tgg ggc age gag gag agg aag agg age atg ccc ttc aaa aag 347 
Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys Lys 
85 90 95 

ggt gcc gcc ttt gag ctg gtc ttc ata gtc. ctg get gag cac tac aag 395 
Gly Ala Ala Phe Glu Leu Val Phe He Val Leu Ala Glu His Tyr Lys 
100 105 110 

gtg gtg gta aat gga aat ccc ttc tat gag tac ggg cac egg ctt ccc 443 
Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu Pro 
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115 120 125 

eta cag atg gtc acc cac ctg caa gtg gat ggg gat ctg caa ctt caa 491 
Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu Gin 
130 135 140 145 

tea ate aac ttc ate gga ggc cag ccc etc egg ccc cag gga ccc ccg 539 
. Ser lie Asn Phe lie Gly Gly Gin Pro Leu Arg Pro Gin Giy Pro Pro 
150 155 160 

atg atg cca cct tac cct ggt ccc gga cat tgc cat caa cag ctg aac 587 
Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu Asn 
165 170 175 

age ctg ccc acc atg gaa gga ccc cca acc ttc aac ccg cct gtg cca 635 
Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val Pro 
180 185 190 

tat ttc ggg agg ctg caa gga ggg etc aca get cga aga acc ate ate 683 
Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He He 
195 . 200 205 

ate aag ggc tat gtg cct ccc aca ggc aag age ttt get ate aac ttc 731 
lie Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn Phe 
210 215 220 225 

aag gtg ggc tec tea ggg gac ata get ctg cac att aat ccc cgc atg 779 
Lys Val Gly Ser Ser Gly Asp He Ala Leu His lie Asn Pro. Arg Met 
230 235 240 

ggc aac ggt acc gtg gtc egg aac age ctt ctg aat ggc teg tgg gga 827 
Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp Gly 
245 250 255 

tec gag gag aag aag ate acc cac aac cca ttt ggt ccc gga cag ttc 875 
Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin Phe 
260 265 .270 

ttt gat ctg tec att cgc tgt ggc ttg gat cgc ttc aag gtt tac gec 923 
Phe Asp Leu Ser He Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala 
275 280 285 

aat ggc cag cac etc ttt gac ttt gec cat cgc etc teg gec ttc cag 971 
Asn Gly <31n His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe Gin 
290 295 300 305 

agg gtg gac aca ttg gaa ate cag ggt gat gtc acc ttg tec tat gtc 1019 
Arg Val Asp Thr Leu Glu He Gin Gly Asp Val Thr Leu Ser Tyr Val 
310 315 320 

cag ate taatctattc ctggggccat aactcatggg aaaacagaat tatcccctag 1075 
Gin He 

gactcctttc taagccccta ataaaatgtc tgagggtgtc tc 1117 

<210> 58 

<211> 323 

<212> PRT 

<213> NMJ)06149 galectin 4, LGALS4 



<400> 58 
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Met Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 
15 10 15 



Leu Pro Tyr Tyr Gin Pro lie Pro Gly Gly Leu Asn Val Gly Met Ser 
20 25 30 



Val Tyr lie Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val 
35 AO 45 



Asn Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val -Ala Phe His Phe 
50 55 60 



Asn Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin 
65 ~ 70 75 80 



Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys 
85 90 95 



Lys Gly Ala Ala Phe Glu Leu Val Phe lie Val Leu Ala Glu His Tyr 
100 105 110 



Lys Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu 
115 120 125 



Pro Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu 
130 135 140 



Gin Ser He Asn Phe He Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro 
145 150 155 160 



Pro Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu 
165 170 175 



Asn Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val 
180 185 190 



Pro Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He 
195 200 205 



.He He Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn 
210 215 220 



Phe Lys Val Gly Ser Ser Gly Asp He Ala Leu Hi's He Asn Pro Arg 
225 230 235 240 



Met Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp 
245 250 255 



Gly Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin 
260 265 270 
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Phe Phe Asp Leu Ser lie Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr 
275 280 ' 285 



Ala Asn Gly Gin His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe 
290 295 300 



Gin Arg Val Asp Thr Leu Glu He Gin Gly Asp Val Thr Leu Ser Tyr 
305 310 315 320 



Val Gin He 



<210> 59 

<211> 3697 

<212> DNA 

<213> NM_004063 cadherin 17, CDH17 



<220> 

<221> CDS 

<222> (121) (2616) 

<223> 



<400> 59 

agggagtgtt cccgggggag atactccagt cgtagcaaga gtctcgacca ctgaatggaa 60 

gaaaaggact tttaaccacc attttgtgac ttacagaaag gaatttgaat aaagaaaact 120 

atg ata ctt cag gcc cat ctt cac tec ctg tgt ctt ctt atg ctt tat 168 
Met He Leu Gin Ala His Leu His Ser Leu Cys Leu Leu Met Leu Tyr 
15 10 15 

ttg gca act gga tat ggc caa gag ggg aag ttt agt gga ccc ctg aaa 216 
Leu Ala Thr Gly Tyr Gly Gin Glu Gly Lys Phe Ser Gly Pro Leu Lys 
20 25 30 

ccc atg aca ttt tct att tat gaa ggc caa gaa ccg agt caa att ata 264 
Pro Met Thr Phe Ser He Tyr Glu Gly Gin Glu Pro Ser Gin He He 
35 40 45 

ttc cag ttt aag gcc aat cct cct get gtg act ttt gaa eta act ggg 312 
Phe Gin Phe Lys Ala Asn Pro Pro Ala Val Thr Phe Glu Leu Thr Gly 
50 55 60 

gag aca gac aac ata ttt gtg ata gaa egg gag gga ctt ctg tat tac 360 
Glu Thr Asp Asn He Phe Val He Glu Arg Glu Gly Leu Leu Tyr Tyr 
65 70 75 80 

aac aga gcc ttg gac agg gaa aca aga tct act cac aat etc cag gtt 408 
Asn Arg Ala Leu Asp Arg Glu Thr Arg Ser Thr His Asn Leu Gin Val 
85 90 95 

gca gcc ctg gac get aat gga att ata gtg gag ggt cca gtc cct ate 456 
Ala Ala Leu Asp Ala Asn Gly He He Val Glu Gly Pro Val Pro He 
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100 105 110 

acc ata gaa gtg aag gac ate aac gac aat cga ccc acg ttt etc cag 504 
Thr lie Glu Val Lys Asp He Asn Asp Asn Arg Pro Thr Phe Leu Gin 
115 120 125 

tea aag tac gaa ggc tea gta agg cag aac tct cgc cca gga aag ccc 552 
Ser Lys Tyr Glu Gly Ser Val Arg Gin Asn Ser Arg Pro Gly Lys Pro 
130 135 140 

ttc ttg tat gtc aat gec aca gac ctg gat gat ccg gec act ccc aat 600 
Phe Leu Tyr Val Asn Ala Thr Asp Leu Asp Asp Pro Ala Thr Pro Asn. 
145 150 155 160 

ggc cag ctt tat tac cag att gtc ate cag ctt ccc atg ate aac aat 648 
Gly Gin Leu Tyr Tyr Gin He Val He Gin Leu Pro Met He Asn Asn 
165 170 175 

gtc atg tac ttt cag ate aac aac aaa acg gga gec ate tct ctt acc 696 
Val Met Tyr Phe Gin He Asn Asn Lys Thr Gly Ala He Ser Leu Thr 
180 185 ' 190 

cga gag gga tct cag gaa ttg aat cct get aag aat cct tec tat aat 744 
Arg Glu Gly Ser Gin Glu Leu Asn Pro Ala Lys Asn Pro Ser Tyr Asn 
195 200 205 

ctg gtg ate tea gtg aag gac atg gga ggc cag agt gag aat tec ttc 792 
Leu Val He Ser Val Lys Asp Met Gly Gly Gin Ser Glu Asn Ser Phe 
210 215 220 

agt gat acc aca tct gtg gat ate ata gtg aca gag aat att tgg aaa 840 
Ser Asp Thr Thr Ser Val Asp He He Val Thr Glu Asn He Trp Lys 
225 230 235 240 

gca cca aaa cct gtg gag atg gtg gaa aac tea act gat cct cac ccc 888 
Ala Pro Lys Pro Val Glu Met Val Glu Asn Ser Thr Asp Pro His Pro 
245 250 255 

ate aaa ate act cag gtg egg tgg aat gat ccc ggt gca caa tat tec 936 
He Lys He Thr Gin Val Arg Trp Asn Asp Pro Gly Ala Gin Tyr Ser 
260 265 270 

. tta gtt gac aaa gag aag ctg cca aga ttc cca ttt tea att gac cag • 984 
Leu Val Asp Lys Glu Lys Leu Pro Arg Phe Pro Phe Ser He Asp Gin 
275 280 285 

gaa gga gat att tac gtg act cag ccc ttg gac cga gaa gaa aag gat 1032 
Glu Gly Asp He Tyr Val Thr Gin Pro Leu Asp Arg Glu Glu Lys Asp 
290 295 300 

gca tat gtt ttt tat gca gtt gca aag gat gag tac gga aaa cca ctt 1080 
Ala Tyr Val Phe Tyr Ala Val Ala Lys Asp Glu Tyr Gly Lys Pro Leu 
305 310 ' 315 320 

tea tat ccg ctg gaa att cat gta aaa gtt aaa gat att aat gat aat 1128 
Ser Tyr Pro Leu Glu He His Val Lys Val Lys Asp He Asn Asp Asn 
325 330 335 

cca cct aca tgt ccg tea cca gta acc gta ttt gag gtc cag gag aat 1176 
Pro Pro Thr Cys Pro Ser Pro Val Thr Val Phe Glu Val Gin Glu Asn 
340 345 350 

gaa cga ctg ggt aac agt ate ggg acc ctt act gca cat gac agg gat 1224 
Glu Arg Leu Gly Asn Ser He Gly Thr Leu Thr Ala His Asp Arg Asp 
355 360 365 



gaa gaa aat act gee aac agt ttt eta aac tac agg att gtg gag caa 
Glu Glu Asn Thr Ala Asn Ser Phe Leu Asn Tyr Arg He Val Glu Gin 
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370 375 380 

act ccc aaa ctt ccc atg gat gga etc ttc eta ate caa acc tat get 
Thr Pro Lys Leu Pro Met Asp Gly Leu Phe Leu lie Gin Thr Tyr Ala 
385 390 395 400 



1320 



gga atg tta cag tta get aaa cag tec ttg aag aag caa gat act cct 1368 
Gly Met Leu Gin Leu Ala Lys Gin Ser Leu Lys Lys Gin Asp Thr Pro 
405 410 415 

cag tac aac tta acg ata gag gtg tct gac aaa gat ttc aag acc ctt 1416 
Gin Tyr Asn Leu Thr lie Glu Val Ser Asp Lys Asp Phe Lys Thr Leu 
420 425' * 430 

tgt ttt gtg caa ate aac gtt att gat ate aat gat cag ate ccc ate 14 64 
Cys Phe Val Gin He Asn Val He Asp He Asn Asp Gin He Pro He 
435 440 445 

ttt gaa aaa tea gat tat gga aac ctg act ctt get gaa gac aca aac 1512 
Phe Glu Lys Ser Asp Tyr Gly Asn Leu Thr Leu Ala Glu Asp Thr Asn 
450 455 460 

att ggg tec acc ate tta acc ate cag gec act gat get gat gag cca 1560 
He Gly Ser Thr He Leu Thr He Gin Ala Thr Asp Ala Asp Glu Pro 
465 470 475 480 

ttt act ggg agt tct aaa att ctg tat cat ate ata aag gga gac- agt 1608 
Phe Thr Gly Ser Ser Lys He Leu Tyr His He He Lys Gly Asp Ser 
485 490 495 

gag gga cgc ctg ggg gtt gac aca gat ccc cat acc aac acc gga tat 1656 
Glu Gly Arg Leu Gly Val Asp Thr Asp Pro His Thr Asn Thr Gly Tyr 
500 505 510 

gtc ata att aaa aag cct ctt gat ttt gaa aca gca get gtt tec aac 1704 
Val He He Lys Lys Pro Leu Asp Phe Glu Thr. Ala Ala Val Ser Asn 
515 520 525 

att gtg ttc aaa gca gaa aat cct gag cct eta gtg ttt ggt gtg aag 1752 
lie Val Phe Lys Ala Glu Asn Pro Glu Pro Leu Val Phe Gly Val Lys 
530 535 540 

tac aat gca agt tct ttt gec aag ttc acg ctt att gtg aca gat gtg 1800 
Tyr Asn Ala Ser Ser Phe Ala Lys Phe Thr Leu He Val Thr Asp Val 
545 550 555 560 

aat gaa gca cct caa ttt tec caa cac gta ttc caa gcg aaa gtc agt 1848 
Asn Glu Ala Pro Gin Phe Ser Gin His Val Phe Gin Ala Lys Val Ser 
565 570 575 

gag gat gta get ata ggc act aaa gtg ggcaat gtg act gee aag gat 1896 
Glu Asp Val Ala He Gly Thr Lys Val Gly Asn Val Thr Ala Lys Asp 
580 585 590 

cca gaa ggt ctg gac ata age tat tea ctg agg. gga gac aca aga ggt 1944 
Pro Glu Gly Leu Asp He Ser Tyr Ser Leu Arg Gly Asp Thr Arg Gly 
595 600 605 

tgg ctt aaa att gac cac gtg act ggt gag ate ttt agt gtg get cca 1992 
Trp Leu Lys He Asp His Val Thr Gly -Glu lie Phe Ser Val Ala Pro 
610 615 620 

ttg gac aga gaa gec gga agt cca tat egg gta caa gtg gtg gec aca 2040 
Leu Asp Arg Glu Ala Gly Ser Pro Tyr Arg Val Gin Val Val Ala Thr 
625 630 635 640 

gaa gta ggg ggg tct tec ttg age tct gtg tea gag ttc cac ctg ate 2088 
Glu Val Gly Gly Ser Ser Leu Ser Ser Val Ser Glu Phe His Leu lie 
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645 



650 



655 



ctt atg gat gtg aat gac aac cct ccc agg eta 
Leu Met Asp Val Asn Asp Asn Pro Pro Arg Leu 
660 665 



gec aag gac tac acg 
Ala Lys Asp Tyr Thr 
670 



2136 



ggc ttg ttc ttc tgc cat ccc etc agt gca cct 
Gly Leu Phe Phe 'Cys His Pro Leu Ser Ala Pro 
675 680 



gga agt etc att ttc 
Gly Ser Leu lie Phe 
685 



2184 



gag get act gat gat gat cag cac tta ttt egg 
Glu Ala Thr Asp Asp Asp Gin His Leu Phe Arg 
690 695 



ggt ccc cat ttt aca 
Gly Pro His Phe Thr 
700 



2232 



ttt tec etc ggc agt gga age tta caa aac gac 
Phe Ser Leu Gly Ser Gly Ser Leu Gin Asn Asp 
705 710 715 



tgg gaa gtt tec aaa 
Trp Glu Val Ser Lys 
720 



2280 



ate aat ggt act cat gee cga ctg tct acc agg 
lie Asn Gly Thr His Ala Arg Leu Ser Thr Arg 
725 730 



cac aca gag ttt gag 
His Thr Glu Phe Glu 
735 



2328 



gag agg gag tat gtc gtc ttg ate egcate aat 
Glu Arg Glu Tyr Val Val Leu lie Arg lie Asn 
740 745 



gat ggg ggt egg cca 
Asp Gly Gly Arg Pro 
750 



2376 



ccc ttg gaa ggc att gtt tct tta cca gtt aca 
Pro Leu Glu Gly He Val Ser Leu Pro Val Thr 
755 760 



ttc tgc agt tgt gtg 
Phe Cys Ser Cys Val 
765 



2424 



gaa gga agt tgt ttc egg cca gca ggt cac cag 
Glu Gly Ser Cys Phe Arg Pro Ala Gly His Gin 
770 775 



act ggg ata ccc act 
Thr Gly He Pro Thr 
780 



2472 



gtg ggc atg gca gtt ggt ata ctg ctg acc acc 
Val Gly Met Ala Val Gly He Leu Leu Thr Thr 
785 790 795 



ctt ctg gtg att ggt 
Leu Leu Val He' Gly 
800 



2520 



ata att tta gca gtt gtg ttt ate cgc ata aag 
He He Leu Ala Val Val Phe He Arg He Lys 
805 810 



aag gat aaa ggc aaa 
Lys Asp Lys Gly Lys 
815 



2568 



gat aat gtt gaa agt get caa gca tct gaa gtc 
Asp Asn Val Glu Ser Ala Gin Ala Ser Glu Val 
820 825 



aaa cct ctg aga age 
Lys Pro Leu Arg Ser 
830 



2616 



tgaatttgaa 


aaggaatgtt 


tgaatttata 


tagcaagtgc 


tatttcagca 


acaaccatct 


2676 


catcctatta 


cttttcatct 


aacgtgcatt 


ataatttttt 


aaacagatat 


tccctcttgt 


2736 


cctttaatat 


ttgetaaata 


tttctttttt 


gaggtggagt 


ettgetctgt 


cgcccaggct 


2796 


ggagtacagt 


ggtgtgatcc 


cagctcactg 


caacctccgc 


ctcctgggtt 


cacatgattc 


2856 


tcctgcctca 


gcttcctaag 


tagctgggtt 


tacaggcacc 


caccaccatg 


cccagctaat 


2916 


ttttgtattt 


ttaatagaga 


eggggttteg 


ccatttggcc 


aggctggtct 


tgaactcctg 


2976 


aegtcaagtg 


atctgcctgc 


cttggtctcc 


caatacaggc 


atgaaccact 


gcacccacct 


3036 


acttagatat 


ttcatgtgct 


atagacatta 


gagagatttt 


tcatttttcc 


atgacatttt 


3096 


tcctctctgc 


aaatggctta 


gctacttgtg 


tttttccctt 


ttggggcaag 


acagactcat 


3156 


taaatattct 


gtacattttt 


tctttatcaa 


ggagatatat 


cagtgttgtc 


tcatagaact 


3216 


gectggatte 


catttatgtt 


ttttctgatt 


ccatcctgtg 


tccccttcat 


ccttgactcc 


3276 
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tttaatattt 


cactaaafctt 


caaacatttg 


tcacraaaacra aaaaccrtcraa 

i ¥ „ 


J. 

gactcaggaa 


3336 




at*aaaacfaac 


aacctttfccc 


v» v- ci y w Q L» L- d- o utio. ^ V7 


fctctatcrtca 


3396 


ttaaccatct 


ttaatcaatg 


tgacatgttg 


ctctttggct gaaattcttc 


aacttggaaa 


3456 


tgacacagac 


ccacagaagg 


tgttcaaaca 


caacctactc tgcaaacctt 


ggtaaaggaa 


3516 


ccagtcagct 


ggccagattt 


cctcactacc 


tgccatgcat acatgctgcg 


catgttttct 


3576 


tcattcgtat 


gttagtaaag 


ttttggttat 


tatatattta acatgtggaa 


gaaaacaaga 


3636 


catgaaaaga 


gtggtgacaa 


atcaagaata 


aacactggtt gtagtcagtt 


ttgtttgtta 


3696 


a 










3G97 



<210> 60 

<211> 832 

<212> PRT 

<213> NMJ)04063 cadherin 17, C0H17 



<400> 60 

Met lie Leu Gin Ala His Leu His Ser Leu Cys Leu Leu Met Leu Tyr 
15 10 15 



Leu Ala Thr Gly Tyr Gly Gin Glu Gly Lys Phe Ser Gly Pro Leu Lys 
20 25 30 



Pro Met Thr Phe Ser lie Tyr Glu Gly Gin Glu Pro Ser Gin lie lie 
35 40 45 



Phe Gin Phe Lys Ala Asn Pro Pro Ala Val Thr Phe Glu Leu Thr Gly 
50 55 60 



Glu Thr Asp Asn lie Phe Val lie Glu Arg Glu Gly Leu Leu Tyr Tyr 
65 70 75 • 80 



Asn Arg Ala Leu Asp Arg Glu Thr Arg Ser Thr His Asn Leu Gin Val 
85 90 .95 



Ala Ala Leu Asp Ala Asn Gly lie lie Val Glu Gly -Pro Val Pro lie 
100 105 110 



Thr He Glu Val Lys Asp He Asn Asp Asn Arg Pro Thr Phe Leu Gin 
115 120 125 



Ser Lys Tyr Glu Gly Ser Val Arg Gin Asn Ser Arg Pro Gly Lys Pro 
130 135 140 



Phe Leu Tyr Val Asn Ala Thr Asp Leu Asp Asp Pro Ala Thr Pro Asn 
145 150 155 160 
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Gly Gin Leu Tyr Tyr Gin lie Val He Gin Leu Pro Met He Asn Asn 
165 170 175 



Val Met Tyr Phe Gin He Asn Asn Lys Thr Gly Ala He Ser Leu Thr 
180 185 190 



Arg Glu Gly Ser Gin Glu Leu Asn Pro Ala Lys Asn Pro Ser Tyr Asn 
195 200 205 



Leu Val He Ser Val Lys Asp Met Gly Gly Gin Ser Glu Asn Ser Phe 
210 215 220 



Ser Asp Thr Thr Ser Val Asp He He Val Thr Glu Asn lie Trp Lys 
225 230 235 240 



Ala Pro Lys Pro Val Glu Met Val Glu Asn Ser Thr Asp Pro His Pro 
245 250 255 

He Lys He Thr Gin Val Arg Trp Asn Asp Pro Gly Ala Gin Tyr Ser 
260 265 270 

Leu Val Asp Lys Glu Lys Leu Pro Arg Phe Pro Phe Ser He Asp Gin 
275 280 285 



Glu Gly Asp He Tyr Val Thr Gin Pro Leu Asp Arg Glu Glu Lys Asp 
290 295 300 



Ala Tyr Val Phe Tyr Ala Val Ala Lys Asp Glu Tyr Gly Lys Pro Leu 
305 310 315 * ^ 320 



Ser Tyr Pro Leu Glu He His Val Lys Val Lys Asp He Asn Asp Asn 
325 330 335 



Pro Pro Thr Cys Pro Ser Pro Val Thr Val Phe Glu Val Gin Glu Asn 
340 345 350 



Glu Arg Leu Gly Asn Ser He Gly Thr . Leu Thr Ala His Asp Arg Asp 
355 360 365 



Glu Glu Asn Thr Ala Asn Ser Phe Leu Asn Tyr Arg He Val Glu Gin 
370 375 380 



Thr Pro Lys Leu Pro Met Asp Gly Leu Phe Leu He Gin Thr Tyr Ala 
385 390 395 400 



Gly Met Leu Gin Leu Ala Lys Gin Ser Leu Lys Lys Gin Asp Thr Pro 
405 410 415 



Gin Tyr Asn Leu Thr He Glu Val Ser Asp Lys Asp Phe Lys Thr Leu 
420 425 430 
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Cys Phe Val Gin He Asn Val: He Asp He Asn Asp Gin He Pro He 
435 440 •• 445 



Phe Glu Lys Ser Asp Tyr Gly Asn Leu Thr Leu Ala Glu Asp Thr Asn 
450 455 460 

He Gly Ser Thr He Leu Thr He Gin Ala Thr Asp Ala Asp Glu Pro 
465 470 - 475 480 

Phe Thr Gly Ser Ser Lys He Leu Tyr Bis He He Lys Gly Asp Ser 
485 490 495 

Glu Gly Arg Leu Gly Val Asp Thr Asp Pro His Thr Asn Thr Gly Tyr 
500 505 510 

Val He He Lys Lys Pro Leu Asp Phe Glu Thr Ala Ala Val Ser Asn 
515 520 525 

He Val Phe Lys Ala Glu Asn Pro Glu.Pro Leu Val Phe Gly Val Lvs 
530 535 540 

Tyr Asn Ala Ser Ser Phe Aia Lys Phe Thr Leu He Val Thr Asp Val 
545 550 555 560 

Asn Glu Ala Pro Gin Phe Ser Gin His Val Phe Gin Ala Lys Val Ser 
565 570 575 

Glu Asp Val Ala He Gly Thr Lys Val Gly Asn Val Thr Ala Lys Asp 
580 585- 590 

Pro Glu Gly Leu Asp He Ser Tyr Ser Leu Arg Gly Asp Thr Arg Gly 
595 600 605 

Trp Leu Lys He Asp His Val Thr Gly Glu He Phe Ser Val Ala Pro 
610 615 620 

Leu Asp Arg Glu Ala Gly Ser Pro Tyr Arg Val Gin Val Val Ala Thr 
625 630 635 640 

Glu Val Gly Gly Ser Ser Leu Ser Ser Val Ser Glu Phe His Leu He 
645 650 655 

Leu Met Asp Val Asn Asp Asn Pro Pro Arg Leu Ala Lys Asp Tyr Thr 
660 665 . 670 

Gly Leu Phe Phe Cys His Pro Leu Ser Ala Pro Gly Ser Leu He Phe 
675 680 685 



Glu Ala Thr Asp Asp Asp Gin His Leu Phe Arg Gly Pro His Phe Thr 
690 695 700 
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Phe Ser Leu Gly Ser Gly Ser Leu Gin Asn Asp Trp Glu Val Ser Lys 
705 710 715 720 



He Asn Gly Thr His Ala Arg Leu Ser Thr Arg- His Thr Glu Phe Glu 
725 730 735 

Glu Arg Glu Tyr Val Val Leu lie Arg He Asn Asp . Gly Gly Arg Pro 
740 745 - 750 

Pro Leu Glu Gly He Val Ser Leu Pro Val Thr Phe Cys Ser Cys Val 
755 ' 760 765 



Glu Gly Ser Cys Phe Arg Pro Ala Gly His Gin Thr Gly He Pro Thr 
770 775 780 



Val Gly Met Ala Val Gly He Leu Leu Thr Thr Leu Leu Val lie Gly 
785 790 795 . 800 



He He Leu Ala Val Val Phe He Arg lie Lys Lys Asp Lys Gly Lys 
805 810 815 



Asp Asn Val Glu Ser Ala Gin Ala Ser Glu Val Lys Pro Leu Arg Ser 
820 825 830 

<210> 61 
<211> 2920 
<212> DNA 

<213> NM_005588; meprin A, alpha 



<220> 

<221> CDS 

<222> (10) . . (2247) 

<223> 



<400> 61 

cttgcagca atg get tgg att aga tec act tgc att etc ttt ttt acc ttg 51 
Met Ala Trp He Arg Ser Thr Cys lie Leu Phe Phe Thr Leu 
15 m 



ctt ttt gee cac ata gca get gta ccg 
Leu Phe Ala His He Ala Ala Val Pro 
15 20 

aat gta cat gat gca gat ttt ggt gaa 
Asn Val His Asp Ala Asp Phe Gly Glu 
35 

aat tta get gca ggc ttg gac etc ttt 
Asn Leu Ala Ala Gly Leu Asp Leu Phe 



att aag cat ctt cct gaa gaa 99 
He Lys His Leu Pro Glu Glu 
25 30 

cag aag gat att tea gaa ate 147 
Gin Lys Asp He Ser Glu lie 
40 45 • 

caa ggg gac ate etc ttg cag 195 
Gin Gly Asp He Leu Leu Gin 
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50 55 60 

aaa tec aga aat ggc ctg aga gac cca aac acc agg tgg acg ttc ccc 243 
Lys Ser Arg Asn Gly Leu Arg Asp Pro Asn Thr Arg Trp Thr Phe Pro 
65 70 75 



att cct tac ate ttg get gat aat ttg ggg ctg aat get aaa gga gee 
lie Pro Tyr lie Leu Ala Asp Asn Leu Gly Leu Asn Ala Lys Gly Ala 
80 85 90 



ttt gac acc tat gat gat age tta ate aca gac etc aat aca ccc tat 
Phe Asp Thr Tyr Asp Asp Ser Leu He Thr Asp Leu Asn Thr Pro Tyr 
195 200 205 



291 



att ctg tat gee ttt gag atg ttc cgt etc aag tec tgt gtg gat ttc 339 
He Leu Tyr Ala Phe Glu Met Phe Arg Leu Lys Ser Cys Val Asp Phe 
95 100 105 no 

aag ccc tat gaa gga gag age tea tat ate ata .ttt caa cag ttt gat 387 
Lys Pro. Tyr Glu Gly Glu Ser Ser Tyr He He Phe Gin Gin Phe Asp 
115 .120 125 

ggg tgc tgg tct gag gtt ggt gac caa cat gtg gga cag aac att tec 435 
Gly Cys Trp Ser Glu Val Gly Asp Gin His Val Gly Gin Asn He Ser 
130 135 140 

att ggc caa gga tgt gec tat aag gee ate ata gaa cac gag ate ctg 483 
He Gly Gin Gly Cys Ala Tyr Lys Ala He He Glu His Glu He Leu 
145 .150 155 

cat get ttg gga ttt tac cac gag cag tea agg acg gac egg gat gat 531 
His Ala Leu Gly Phe Tyr His Glu Gin Ser Arg Thr Asp Arg Asp Asp 
160 165 170 

tat gtg aac ate tgg tgg gac caa att ctt tea ggt tac cag cac aac 579 
Tyr Val Asn He Trp Trp Asp Gin He Leu Ser Gly Tyr Gin His Asn 
175 180 185 190 



627 



gat tat gag tct ttg atg cac tac cag cct ttc tea ttt aac aag aat 675 
Asp Tyr Glu Ser Leu Met His Tyr Gin Pro Phe Ser Phe Asn Lys Asn 
210 215 . 220 

gca agt gtt ccc acc ate aca gee aag ate cct gag ttt aac tec att 723 
Ala Ser Val Pro Thr He Thr Ala Lys. He Pro Glu Phe Asn Ser He 
225 230 235 

ate gga caa cgc ctg gat ttc agt gee att gat tta gag agg ctg aac 771 
He Gly Gin Arg Leu Asp Phe Ser Ala He Asp Leu Glu Arg Leu Asn 
240 245 . 250 

cga atg tac aat tgc acc aca act cac act ctt ttg gac cac tgt act 819 
Arg Met Tyr Asn Cys Thr Thr Thr His Thr Leu Leu Asp His Cys Thr 
255 260 265 270 

ttt gag aag gca aac ate tgt gga atg att cag ggc acc aga gat gac 867 
Phe Glu Lys Ala Asn He Cys Gly Met He Gin Gly Thr Arg Asp Asp 
275 280 285 

act gac tgg gee cat cag gac agt get cag get gga gaa gtg gat cac 915 
Thr Asp Trp Ala His Gin Asp Ser Ala Gin Ala Gly Glu Val Asp His 
290 295 300 

acc ttg ttg gga caa tgc aca ggt gee ggc tac ttc atg cag ttc age 963 
Thr Leu Leu Gly Gin Cys Thr Gly Ala Gly Tyr Phe Met Gin Phe Ser 
305 310 315 

acc age teg ggg tec gcg gaa gag gca gee eta ctg gag tct egg att 1011 
Thr Ser Ser Gly Ser Ala Glu Glu Ala Ala Leu Leu Glu Ser Arg He 
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320 325 330 

ctt tac cca aag agg aag cag cag tgc ctg caa ttt ttc tat aaa atg 1059 
Leu Tyr Pro Lys Arg Lys Gin Gin Cys Leu Gin Phe Phe Tyr Lys Met 
335 340 345 350 

acg gga agt cct tea gac aga etc gtt gtc tgg gtc agg agg gat gac 1107 
Thr Gly Ser Pro Ser Asp Arg Leu Val Val Trp Val Arg Arg Asp Asp 
355 360 365 

age aca ggc aat gtt cgc aag ttg gtg aag gtg cag act ttt caa gga 1155 
Ser Thr Gly Asn Val Arg Lys Leu Val Lys Val Gin Thr Phe Gin Gly 
370 375 . 380 

gat gat gac cac aat tgg aaa att gec cat gtg gtg etc aaa gag gaa 1203 
Asp Asp Asp His Asn Trp Lys lie Ala His Val Val Leu Lys Glu Glu 
385 390 395 

cag aag ttt cgc tac ctt ttc cag ggc aca' aaa ggc gac cct cag aac 1251 
Gin Lys Phe Arg Tyr Leu Phe Gin Gly Thr Lys Gly Asp Pro Gin Asn 
400 405 410 

tea act ggg gga att tac eta gat gac ate act ctg aca gaa acc ccc 1299 
Ser Thr Gly Gly He Tyr Leu Asp Asp He Thr Leu Thr Glu Thr Pro 
415 420 425 430 

tgc ccc aca ggg gtc tgg aca gtc egg aat ttc tec caa gtc ctt gag 1347 
Cys Pro Thr Gly Val Trp Thr Val Arg Asn Phe Ser Gin Val Leu Glu 
435 440 445 

aac acc age aaa ggg gac aag ctt cag age cct cga ttc tac aat teg 1395 
Asn Thr Ser Lys Gly Asp Lys Leu Gin Ser Pro Arg Phe Tyr Asn Ser 
450 455 460 

gag gga tat ggt ttt ggg gta act tta tac cca aat age aga gaa age 1443 
Glu Gly Tyr Gly Phe Gly Val Thr Leu Tyr Pro Asn Ser Arg Glu Ser 
465 470 475 

tct ggt tac ttg aga ctt get ttt cat gtg tgc agt ggg gag aac gat 1491 
Ser Gly Tyr Leu Arg Leu Ala Phe His Val Cys Ser Gly Glu Asn Asp 
480 485 490 

get ate ctg gag tgg ccg gta gaa aac aga cag gtg ata att acc ate 1539 
Ala He Leu Glu Trp Pro Val Glu Asn Arg Gin Val He He Thr He 
495 500 505 510 

ctt gac cag gag cct gat gtc egg aac agg atg tec tea age atg gtg 1587 
Leu Asp Gin Glu Pro Asp Val Arg Asn Arg Met Ser Ser Ser Met Val 
515 520 525 



1635 



ttc act acc teg aag teg cac aca tct cca gcg ata aat gac act gtc 
Phe Thr Thr Ser Lys Ser His Thr Ser Pro Ala He Asn Asp Thr Val 
530 535 540 

ate tgg gac agg ccg tec agg gtg gga acc tat cat aca gac tgt aat 1683 
He Trp Asp Arg Pro Ser Arg Val Gly Thr Tyr His Thr Asp Cys Asn 
545 550 555 

tgt ttt aga age ate gac ttg ggc tgg agt ggt ttc att tec cac caa 1731 
Cys Phe Arg Ser He Asp Leu Gly Trp Ser Gly Phe He Ser His Gin 
560 565 .570 

atg ctg aaa agg agg agt ttc ctg aaa aat gat gac etc ate ata ttt 1779 
Met Leu Lys Arg Arg Ser Phe Leu Lys .Asn Asp Asp Leu He He Phe 
575 580 585 590 

gtg gac ttt gaa gat ate acc cac etc age cag act gaa gtt ccc tct 1827 
Val Asp Phe Glu Asp He Thr His Leu Ser Gin Thr Glu Val Pro Ser 
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595 



600 



605 



aaa ggc aaa aga ctg age ccc caa ggc etc att 
Lys Gly Lys Arg Leu Ser Pro Gin Gly Leu lie 
610 615 



etc caa ggc cag gag 
Leu Gin Gly Gin Glu 
620 



1875 



cag cag gtc tec gaa gaa ggt teg gga aag gec 
Gin Gin Val Ser Glu Glu Gly Ser Gly Lys Ala 
625 630 



atg tta gag gaa gec 
Met Leu Glu Glu Ala 
635 



1923 



eta cct gtc age ctg age cag ggg cag ccc age 
Leu Pro Val Ser Leu Ser Gin Gly Gin Pro Ser 
640 645 



cga cag aag egg teg 
Arg Gin Lys Arg Ser 
650 



1971 



gtg gag aac aca ggc ccc ctg gag gac cat aac 
Val Glu Asn Thr Gly Pro Leu Glu Asp His Asn 
655 660 665 



tgg cca cag tac ttc 
Trp Pro Gin Tyr Phe 
670 



2019 



aga gac cca tgt gac cca aac cct tgc caa aat 
Arg Asp Pro Cys Asp Pro Asn Pro Cys Gin Asn 
675 680 



gac ggc ate tgt gtg 
Asp Gly He Cys Val 
685 



2067 



aac gtg aag ggg atg gcg age tgc agg tgc ate 
Asn Val Lys Gly Met Ala Ser Cys Arg Cys He 
690 695 



tct .gga cat get ttc 
Ser Gly His Ala Phe 
700 



2115 



ttc tac acg ggg gag cgc tgt cag teg gee gag 
Phe Tyr Thr Gly Glu Arg Cys Gin Ser Ala Glu 
705 710 



gtg cac ggc agt gtc 
Val His Gly Ser Val 
.715 



2163 



ctg ggc atg gtg ate gga ggc acg get ggc gtg ate ttc ttg acc ttc 
Leu Gly Met Val He Gly Gly Thr Ala Gly Val- He Phe Leu Thr Phe 



2211 



720 




725 




730 






tec ate ate gec ate ctt tec caa agg cca agg 
Ser He He Ala He Leu Ser Gin Arg Pro Arg 
735 " 740 745 


aag tgacctgcct 
Lys 


2257 


gctggcattg 


gccagaccac 


agcagcacct 


cctccatgca 


ggecttaact 


ttcccatgtt 


2317 


caatgeagtt 


tggggcagct 


tttttatcag 


ecttgetttg 


gataggacct 


ccaaggacta 


2377 


agcctccagc 


cccatgtgtg 


acccttgtca 


tctctctgcc 


ccacataatt 


atgttacttt 


2437 


gctatgtgct 


cctaatgtat 


ctagtgtgtc 


ctgtgacaac 


actcatcaca 


cttcattgta 


2497 


aatcacttgt 


tttattgact 


gtctttccta 


tagactgtaa 


gctccatgag 


ggcaggcaca 


2557 


tgttgttctc 


attgacegtg 


ctggccccag 


tgcctagatg 


catggctggc 


acattgttgg 


2617 


cactcaacaa 


tggttgaatg 


aataaaacaa 


taaatgaatg 


aataactaag 


atatagaaac 


2677 


tctcatttat 


attgeagatt 


gaatatatat 


gatgaaattc 


ttatgttgaa 


tatgttagaa 


2737 


tcaaatactc 


atttttcatt 


agatacagta 


gtgtcatcac 


tcttttaaga 


tcttgttaaa 


2797 


gatttcaaat 


aaaggtactt 


ctggcgagcc 


aggctgeaca 


geatttgett 


tcctctgaga 


2857 


ttctaagaga 


aggectttaa 


taaatttaat 


aaatattgag 


ttagcaaaaa 


aaaaaaaaaa 


2917 


aaa 












2920 



<210> 62 
<211> 746 
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<212> PRT 

<213> NM_005588; meprin A, alpha 



<400> 62 

Met Ala Trp lie Arg Ser Thr Cys lie Leu Phe Phe Thr Leu Leu Phe 
1 5 10 15 



Ala His He Ala Ala Val Pro He Lys His Leu Pro Glu Glu Asn Val 
20 25 30 



His Asp Ala Asp Phe Gly Glu Gin Lys Asp He Ser Glu He Asn Leu 
35 40 45 

Ala Ala Gly Leu Asp Leu Phe Gin Gly Asp He Leu Leu Gin Lys Ser 
50 55 60 

Arg Asn Gly Leu Arg Asp Pro Asn Thr Arg Trp Thr Phe Pro He Pro 
65 70 75 80 

Tyr He Leu Ala Asp Asn Leu Gly Leu Asn Ala Lys Gly Ala He Leu 
85 90 95 



Tyr Ala Phe Glu Met Phe Arg Leu Lys Ser Cys Val Asp Phe Lys Pro 
100 105 no 



Tyr Glu Gly Glu Ser Ser Tyr He He Phe Gin Gin Phe Asp Gly Cys 
115 120 125 



Trp Ser Glu Val Gly Asp Gin His Val Gly Gin Asn He Ser He Gly 
130 135 140 



Gin Gly Cys Ala Tyr Lys Ala He He Glu His Glu He Leu His Ala 
145 150 155 160 



Leu Gly Phe Tyr His Glu Gin Ser Arg Thr Asp Arg Asp Asp Tyr Val 
165 170 175 



Asn He Trp Trp Asp Gin He Leu Ser Gly Tyr Gin .His Asn Phe Asp 
1B0 185 190 



Thr Tyr Asp Asp Ser Leu He Thr Asp Leu Asn Thr Pro Tyr Asp Tyr 
195 200 205 



Glu Ser Leu Met His Tyr Gin Pro Phe Ser Phe Asn Lys Asn Ala Ser 
210 215 220 



Val Pro Thr He Thr Ala Lys He Pro Glu Phe Asn Ser He He Gly 
225 230 235 240 
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Gin Arg Leu Asp Phe Ser Ala He Asp Leu Glu Arg Leu Asn Arg Met 
245 250 255 



Tyr Asn Cys Thr Thr Thr His Thr Leu Leu Asp His Cys Thr Phe Glu 
260 265 270 



Lys Ala Asn He Cys Gly Met He Gin .Gly Thr Arg Asp Asp Thr Asp 
275 280 285 



Trp Ala His Gin Asp Ser Ala Gin Ala Gly Glu Val Asp His Thr Leu 
290 295 300 



Leu Gly Gin Cys Thr Gly Ala Gly Tyr Phe Met Gin Phe Ser Thr Ser 
305 310 315 320 



Ser Gly Ser Ala Glu Glu Ala Ala Leu Leu Glu Ser Arg He Leu Tyr 
325 330 335 



Pro Lys Arg Lys Gin Gin Cys Leu Gin Phe Phe Tyr Lys Met Thr Gly 
340 345 350 



Ser Pro Ser Asp Arg Leu Val Val Trp Val Arg Arg Asp Asp Ser Thr 
355 360 365 



Gly Asn Val Arg Lys Leu Val Lys Val Gin Thr Phe Gin Gly Asp Asp 
370 375 380 



Asp His Asn Trp Lys He Ala His Val Val Leu Lys Glu Glu Gin Lys 
385 390 395 400 



Phe Arg Tyr Leu Phe Gin Gly Thr Lys Gly Asp Pro Gin Asn Ser Thr 
405 410 415 



Gly Gly He Tyr Leu Asp Asp lie Thr Leu Thr Glu .Thr Pro Cys Pro 
420 425 430 



Thr Gly Val Trp Thr Val Arg Asn Phe Ser Gin Val Leu Glu Asn Thr 
435 440 445 



Ser Lys Gly Asp Lys Leu Gin Ser Pro Arg Phe Tyr Asn Ser Glu Gly 
450 455 460 



Tyr Gly Phe Gly Val Thr Leu Tyr Pro Asn Ser Arg Glu Ser Ser Gly 
465 470 475 480 



Tyr Leu Arg Leu Ala Phe His Val Cys Ser Gly Glu Asn Asp Ala He 
485 490 495 



Leu Glu Trp Pro Val Glu Asn Arg Gin Val He' He Thr lie Leu Asp 
500 505 510 
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Gin Glu Pro Asp Val Arg Asn Arg Met Ser Ser Ser Met Val Phe Thr 
515 • 520 525 



Thr Ser Lys Ser His Thr Ser Pro Ala lie Asn Asp Thr Val He Trp 
530 535 540 



Asp Arg Pro Ser Arg Val Gly Thr Tyr His Thr Asp Cys Asn Cys Phe 
545 550 555 560 



Arg Ser He Asp Leu Gly Trp Ser Gly Phe He Ser His Gin Met Leu 
565 570 575 



Lys Arg Arg Ser Phe Leu Lys Asn Asp Asp Leu He He Phe Val Asp 
580 585 590 



Phe Glu Asp lie Thr His Leu Ser Gin THr Glu Val Pro Ser Lys Gly 
595 600 605 



Lys Arg Leu Ser Pro Gin Gly Leu He Leu Gin Gly Gin Glu Gin Gin 
610 615 620 



Val Ser Glu Glu Gly Ser Gly Lys Ala Met Leu Glu Glu Ala Leu Pro 
625 630 635 640 



Val Ser Leu Ser Gin Gly Gin Pro Ser Arg Gin Lys Arg Ser Val Glu 
645 650 " 655 



Asn Thr Gly Pro Leu Glu Asp His Asn Trp Pro Gin Tyr Phe Arg Asp 
660 665 670 



Pro Cys Asp Pro Asn Pro Cys Gin Asn Asp Gly He Cys Val Asn Val 
675 680 " 685 



Lys Gly Met Ala Ser Cys Arg Cys He Ser Gly His Ala Phe Phe Tyr 
690 695 700 



Thr Gly Glu Arg Cys Gin Ser Ala Glu Val His Gly Ser Val Leu Gly 
705 710 715 ' 720 



Met Val He Gly Gly Thr Ala Gly Val He Phe Leu Thr Phe Ser He 
725 730 735 



He Ala He Leu Ser Gin Arg Pro Arg Lys 
740 745 



<210> 63 

<211> 8838 

<212> DNA 

<213> NMJ)15902 EDD, hyperplastic discs protein, DD5 
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<220> 

<22l> CDS 

<222> (34) . . (8430) 

<223> 



<400> 63 

cgccctcgag tggaggacga gaaggaaagc acc atg acg tec ate cat ttc gtg 54 

, Met Thr Ser lie His Phe Val 
15 

gtt cac ccg ctg ccg ggc acc gag gac cag etc aat gac agg tta cga 102 
Val His Pro Leu Pro Gly Thr Glu Asp Gin Leu Asn Asp Arg Leu Arg 
10 15 20 

gaa gtt tct gag aag ctg aac aaa tat aat tta aac age cac ccc cct 150 
Glu Val Ser Glu Lys Leu Asn Lys Tyr Asn Leu Asn Ser His Pro Pro • 
25 30 35 

ttg. aat gta ttg gaa cag get act att aaa cag tgt .gtg gtg gga cca 198 
Leu Asn Val Leu Glu Gin Ala Thr He Lys Gin Cys Val Val Gly Pro 
40 45 50 55 

aat cat get gee ttt ctt ctt gag gat ggt aga gtt tgc agg att ggt 246 
Asn His Ala Ala Phe Leu Leu Glu Asp Gly Arg Val Cys Arg He Gly 
60 65 70 

ttt tea gta cag cca gac aga ttg gaa ttg ggt aaa cct gat aat aat 294 
Phe Ser Val Gin Pro Asp Arg Leu Glu Leu Gly Lys Pro Asp Asn Asn 
75 80 85 

gat ggg tea aag ttg aac age aac teg ggg gca ggg agg acg tea agg 342 
Asp Gly Ser Lys Leu Asn Ser Asn Ser Gly Ala Gly Arg Thr Ser Arg 
90 95 100 

cct ggt agg aca age gac tct cca tgg ttt etc tea ggt tct gag act 390 
Pro Gly Arg Thr Ser Asp Ser Pro Trp Phe Leu Ser Gly Ser Glu Thr 
105 110 115 

eta ggc agg ctg gca ggc aac acc tta gga age cgc tgg agt tct gga 438 
Leu Gly Arg Leu Ala Gly Asn Thr Leu Gly Ser Arg Trp Ser Ser Gly 
120 125 130 135 

gtg ggt gga agt ggt gga gga tec tct .ggt agg tea tea get gga get 486 
Val Gly Gly Ser Gly Gly Gly Ser Ser Gly Arg Ser Ser Ala Gly Ala 
140 145 150 

cga gat tec cgc egg cag act cga gtt att egg aca gga egg gat cga 534 
Arg Asp Ser Arg Arg Gin Thr Arg Val He Arg Thr Gly Arg Asp Arg 
155 160 165 

ggg tct ggg ctt ttg ggc agt cag ccc cag cca gtt att cca gca tct 582 
Gly Ser Gly Leu Leu Gly Ser Gin Pro Gin Pro Val He Pro Ala Ser 
170 175 180 

gtc att cca gag gag ctg att tea cag gee caa gtt gtt tta caa ggc 630 
Val He Pro Glu Glu Leu He Ser Gin Ala Gin Val Val Leu Gin Gly 
185 190 195 

aaa tec aga agt gtc att att cga gaa ctt cag aga aca aat ctt gat 678 
Lys Ser Arg Ser Val He He Arg Glu Leu Gin Arg Thr Asn Leu Asp 
200 205 210 215 
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gtg aac ctt get gta aat aat tta ctt age egg- gat gat gaa gat gga 726 
Val Asn Leu Ala Val Asn Asn Leu Leu Ser Arg Asp Asp Glu Asp Gly 
220 225 * 230 

gat gat ggg gat gat aca gee age gaa tct tat ttg cct gga gag gat 774 
Asp Asp Gly Asp Asp Thr Ala Ser Glu Ser Tyr Leu Pro Gly Glu Asp 
235 240 245 

ctt atg tct etc ctt gat gee gac att cat tct. gec cac cca agt gtc 822 
Leu Met Ser Leu Leu Asp Ala Asp lie His Ser Ala His Pro Ser Val 
250 255 260 

att att gat gca gat gec atg ttt tct gaa gac att age tat ttt ggt 870 
He lie Asp Ala Asp Ala Met Phe Ser Glu Asp lie Ser Tyr Phe Gly 
265 270 275 

tac cct tct ttt cgt cgt tea tea ctt tec agg eta ggc tea tct cga 918 
Tyr Pro Ser Phe Arg Arg Ser Ser Leu Ser Arg Leu Gly Ser Ser Arg 
280 285 290 295 

gtt etc ctt ctt ccc tta gag aga gac tct gag ctg ttg cgt gaa cgt 966 
Val Leu Leu Leu Pro Leu Glu Arg Asp Ser Glu Leu Leu Arg Glu Arg 
300 305 310 

gaa tec gtt tta cgt tta cgt gaa cga agg tgg ctt gat gga gec tea 1014 
Glu Ser Val Leu Arg Leu Arg Glu Arg Arg Trp Leu Asp Gly Ala Ser 
315 320 325 

ttt gat aat gaa agg ggt tct acc age aag gaa gga .gag cca aac ttg 1062 
Phe Asp Asn Glu Arg Gly Ser Thr Ser Lys Glu Gly Glu Pro Asn Leu 
330 335 340 

gat aag aag aat aca cct gtt caa agt cca gta tct eta gga gaa gat 1110 
Asp Lys Lys Asn Thr Pro Val Gin Ser Pro Val Ser Leu Gly Glu Asp 
345 350 355 

ttg cag tgg tgg cct gat aag gat gga aca aaa ttc ate tgt att- ggg 1158 
Leu Gin Trp Trp Pro Asp Lys Asp Gly Thr Lys Phe He Cys He Gly 
360 365 370 375 

get ctg tat tct gaa ctt ctg get gtc age agt aaa gga gaa ctt tat 1206 
Ala Leu Tyr Ser Glu Leu Leu Ala Val Ser Ser Lys Gly Glu Leu Tyr 
380 385 * 390 

cag tgg aaa tgg agt gaa tct gag cct tac aga aat gee cag aat cct 1254 
Gin Trp Lys Trp Ser Glu Ser Glu Pro Tyr Arg Asn Ala Gin Asn Pro 
395 400 405 

tea tta cat cat cca cga gca aca ttt ttg ggg tta acc aat gaa aag 1302 
Ser Leu His His Pro Arg Ala Thr Phe Leu Gly Leu Thr Asn Glu Lys 
410 415 . 420 

ata gtc etc ctg tct gca aat age ata aga gca act gta get aca gaa 1350 
He Val Leu Leu Ser Ala Asn Ser lie Arg Ala Thr Val Ala Thr Glu 
425 430 435 

aat aac aag gtt get aca tgg gtg gat gaa act tta agt tct gtg get 1398 
Asn Asn Lys Val Ala Thr Trp Val Asp Glu Thr Leu Ser Ser Val Ala 
440 445 450 455 

tct aaa tta gag cac act get cag act tac tct gaa ctt caa gga gag 1446 
Ser Lys Leu Glu His Thr Ala Gin Thr Tyr Ser Glu Leu Gin Gly Glu 
460 465 470 

egg ata gtt tct tta cat tgc tgt gec ctt tac acc tgc get cag ctg 14 94 
Arg He Val Ser Leu His Cys Cys Ala Leu Tyr Thr Cys Ala Gin Leu 
475 480 485 
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gaa aac agt tta tat tgg tgg ggt gta gtt cct ttt agt caa agg aag 1542 
Glu Asn Ser Leu Tyr Trp Trp Gly Val Val Pro' Phe Ser Gin Arg Lys 
490 495 500 

aaa atg tta gag aaa get aga gca aaa .aat aaa aag cct aaa tec agt 1590 
Lys Met Leu Glu Lys Ala Arg Ala Lys Asn Lys Lys Pro Lys Ser Ser 
505 510 515 

get ggt att tct tea atg ccg aac ate act gtt ggt acc cag gta tgc 1638 
Ala Gly He Ser Ser Met Pro Asn He Thr Val Gly Thr Gin Val Cys 
520 525 530 535 

ttg aga aat aat cct ctt tat cat get gga gca gtt gca ttt tea att 1686 
Leu Arg Asn Asn Pro Leu Tyr His Ala Gly Ala Val AJ.a Phe Ser He 
540 545 550 

agt get ggg att cct aaa gtt ggt gtc tta atg gag tea gtt tgg aat 1734 
Ser Ala Gly He Pro Lys Val Gly Val Leu Met Glu Ser Val Trp Asn 
555 560 565 

atg aat gac age tgt aga ttt caa ctt aga tct cct gaa age ttg aaa 1782 
Met Asn Asp Ser Cys Arg Phe Gin Leu Arg Ser Pro Glu Ser Leu Lys 
570 575 580 

aac atg gaa aaa get age aaa act act gaa get aag cct gaa agt aag 1830 
Asn Met Glu Lys Ala Ser Lys Thr Thr Glu Ala Lys Pro Glu Ser Lys 
585 590 595 

cag gag cca gtg aaa aca gaa atg ggt cct. cca cca tct cca gca tec 1878 
Gin Glu Pro Val Lys Thr Glu Met Gly Pro Pro Pro Ser Pro Ala Ser 
600 605 610 615 

acg tgt agt gat gca tec tea att gee age agt gca tea atg cca tac 1926 
Thr Cys Ser Asp Ala Ser Ser He Ala Ser' Ser Ala Ser Met Pro Tyr 
620 625 630 

aaa cga cga egg tea acc cct gca cca aaa gaa gag gaa aag gtg aat 1974 
Lys Arg Arg Arg Ser Thr Pro Ala Pro Lys Glu Glu Glu Lys Val Asn 
635 640 645 

gaa gag cag tgg tct ctt egg gaa gtg gtt ttt gtg gaa gat gtc aag 2022 
Glu Glu Gin Trp Ser Leu Arg Glu Val Val Phe Val Glu Asp Val Lys 
650 655 660 

aat gtt cct gtt ggc aag gtg eta aaa gta gat ggt gee tat gtt get 2070 
Asn Val Pro Val Gly Lys Val Leu Lys Val Asp Gly Ala Tyr Val Ala 
665 670 675 

gta aaa ttt cca gga acc tec agt aat .act aac tgt cag aac age tct 2118 
Val Lys Phe Pro Gly Thr Ser Ser Asn Thr Asn Cys Gin Asn Ser Ser 
680 685 690 695 

ggt cca gat get gac cct tct tct etc ctg cag gat tgt agg tta ctt 2166 
Gly Pro Asp Ala Asp Pro Ser Ser Leu Leu Gin Asp Cys Arg Leu Leu 
700 705 710 

aga att gat gaa ttg cag gtt gtc aaa act ggt gga aca ccg aag gtt 2214 
Arg He Asp Glu Leu Gin Val Val Lys Thr Gly Gly Thr Pro Lys Val 
715 720 725 

ccc gac tgt ttc caa agg act cct aaa aag ctt tgt ata cct gaa aaa 2262 
Pro Asp Cys Phe Gin Arg Thr Pro Lys Lys Leu Cys He Pro Glu Lys 
730 735 740 

aca gaa ata tta gca gtg aat gta gat tec aaa ggt gtt cat get gtt 2310 
Thr Glu He Leu Ala Val Asn Val Asp Ser Lys Gly Val His Ala Val 
745 750 755 
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ctg aag act gga aat tgg gtg cga tac tgt ate ttt gat ctt get aca 
Leu Lys Thr Gly Asn Trp Val Arg Tyr Cys lie Phe Asp Leu Ala Thr 
7 60 765 770 



2358 



775 

gga aaa gca gaa cag gaa aat aat ttt cct aca age age att get ttc 2406 
Gly Lys Ala Glu Gin Glu Asn Asn Phe Pro Thr Ser Ser He Ala Phe 
780 785 790 

ctt ggt cag aat gag agg aat gta gee att ttc act get gga cag gaa 2454 
Leu Gly Gin Asn Glu Arg Asn Val Ala He Phe Thr Ala Gly Gin Glu 
795 800 805 

tct ccc att att ctt cga gat gga aat ggt acc ate tac cca atg gec 2502 
Ser Pro He He Leu Arg Asp Gly Asn Gly Thr He Tyr Pro Met Ala 
810 815 820 

aaa gat tgc atg gga gga ata agg gat ccc gat tgg ctg gat ctt cca 2550 
Lys Asp Cys Met Gly Gly He Arg Asp Pro Asp Trp Leu Asp Leu Pro 
825 830 835 

■ cct att agt agt ctt gga atg ggt gtg cat tct tta ata aat ctt cct 2598 
Pro He Ser Ser Leu Gly Met Gly Val His Ser Leu He Asn Leu Pro 
840 845 850 855 

gec aat tea aca ate aaa aag aaa get get gtt ate ate atg get gta 2646 
Ala Asn Ser Thr He Lys Lys Lys Ala Ala Val He He Met Ala Val 
860 865 870 

gag aaa caa acc tta atg caa cac att ctg cgc tgt gac tat gag gee 2694 
Glu Lys Gin Thr Leu Met Gin His He Leu Arg Cys Asp Tyr Glu Ala 
875 880 885 

tgt cga caa tat eta atg aat ctt gag caa gcg gtt gtt tta gag cag 2742 
Cys Arg Gin Tyr Leu Met Asn Leu Glu Gin Ala Val Val Leu Glu Gin 
890 895 900 

aat eta cag atg ctg cag aca ttc ate age cac aga 'tgt gat gga aat 2790 
Asn Leu Gin Met Leu Gin Thr Phe He Ser His Arg Cys Asp Gly Asn 
905 910 915 

cga aat att ttg cat get tgt gta tea gtt tgc ttt cca acc age aat 2838 
Arg Asn He Leu His Ala Cys Val Ser Val Cys Phe Pro Thr Ser Asn 
920 925 930 935 

aaa gaa act aaa gaa gaa gag gaa gcg gag cgt tct gaa aga aat aca 2886 
Lys Glu Thr Lys Glu Glu Glu Glu Ala Glu Arg Ser Glu Arg Asn Thr 
940 945 950 

ttt gca gaa agg ctt tct get gtt gag gec att gca aat gca ata tea 2934 
Phe Ala Glu Arg Leu Ser Ala Val Glu Ala He Ala Asn Ala He Ser 
955 960 965 

gtt gtt tea agt aat ggc cca ggt aat egg get gga tea tea agt age 2982 
Val Val Ser Ser Asn Gly Pro Gly Asn Arg Ala Gly Ser Ser Ser Ser 
970 975 980 

cga agt ttg aga tta egg gaa atg atg aga cgt teg ttg aga gca get 3030 
Arg Ser Leu Arg Leu Arg Glu Met Met Arg Arg Ser Leu Arg Ala Ala 
985 990 995 

ggt ttg ggt aga cat gaa get gga get tea tec agt gac cac cag 3075 

Gly Leu Gly Arg His. Glu Ala Gly Ala Ser Ser Ser Asp His Gin 
1000 1005 1010 

gat cca gtt tea ccc ccc ata get ccc cct agt tgg gtt cct gac 3120 

Asp Pro Val Ser Pro Pro He Ala Pro Pro Ser Trp Val Pro Asp 
1015 1020 1025 



n 

WO 2004/022778 



164 



0 

PCT/AU2003/001166 



cct cct gcg atg gat cct gat ggt gac att gat 
Pro Pro Ala Met Asp Pro Asp Gly Asp He Asp 
1030 1035 1040 



ttt ate ctg gec 
Phe He Leu Ala 



ccc get gtg gga tct ctt acc aca gca gca acc ggt act ggt caa 
Pro Ala Val Gly Ser Leu Thr Thr Ala Ala Thr Gly Thr Gly Gin 
1045 1050 1055 

gga cca age acc tec act att cca ggt cct tec aca gag cca tct 
Gly Pro Ser Thr Ser Thr He Pro Gly Pro Ser Thr Glu Pro Ser 
1060 1065 . 1070 

gta gta gaa tec aag gat cga aag gcg aat get cat ttt ata ttg 
Val Val Glu Ser Lys Asp Arg Lys Ala Asn Ala His Phe He Leu 
107 * 1080 1085 

aaa ttg tta tgt gac agt gtg gtt etc cag ccc tat eta cga gaa 
Lys Leu Leu Cys Asp Ser Val Val Leu Gin Pro Tyr Leu Arg Glu 
10 90 1095. iioo 

ctt ctt tct gee aag gat gca aga ggg atg acc cca ttt atg tea 
Leu Leu Ser Ala Lys Asp Ala Arg Gly Met Thr Pro Phe Met Ser 
1105 mo ni5 



get gta agt ggc cga get 
Ala Val Ser Gly Arg Ala 
1120 H25 



tat cct get gca att acc ate tta gaa 
Tyr Pro Ala Ala He ' Thr He Leu Glu 
1130 



act get cag aaa att gca aaa get gaa ata tec tea agt gaa aaa 
Thr Ala Gin Lys He Ala Lys Ala Glu He Ser Ser Ser Glu Lys 
H35 U40 1145 

gag gaa gat gta ttc atg gga atg gtt tgc cca tea ggt acc aac 
Glu Glu Asp Val Phe Met Gly Met Val Cys Pro Ser Gly Thr Asn 
H50 H55 H60 

cct gat gac tct cct tta tat gtt tta tgt tgt aat gac act tgc 
Pro Asp Asp Ser- Pro Leu Tyr Val Xeu Cys Cys Asn Asp Thr Cys 
1165 H70 ' H75 

agt ttt aca tgg act gga gca gag cac att aac cag gat att ttt 
Ser Phe Thr Trp Thr Gly Ala Glu His He Asn Gin Asp He Phe 
118 <> 1185 H90 

gag tgt cga act tgt ggc ttg ctg gag tea ctg tgt tgt tgt acg 
Glu Cys Arg Thr Cys Gly Leu Leu Glu Ser Leu Cys Cys Cys Thr 
1195 1200 1205 

gaa tgt gca agg gtt tgt cat aaa ggt cat gat tgc aaa etc aaa 
Glu Cys Ala Arg Val Cys His Lys Gly His Asp Cys Lys Leu Lys 
1210 1215 1220 

egg aca tea cca aca gee tac tgt gat tgt tgg gag aaa tgt aaa 
Arg Thr Ser Pro Thr Ala Tyr Cys Asp Cys Trp Glu Lys Cys Lys 
1225 1230 1235 

tgt aaa act ctt att get gga cag aaa tct get cgt ctt gat eta 
Cys Lys Thr Leu He Ala Gly Gin Lys Ser Ala Arg Leu Asp Leu 
12^0 1245 . - 1250 

ctt tat cgc ctg etc act get act aat ctg gtt act ctg cca aac 
Leu Tyr Arg Leu Leu Thr Ala Thr Asn' Leu Val Thr Leu Pro Asn 
1255 1260 1265 



3165 



3210 



3255 



3300 



3345 



3390 



3435 



3480 



3525 



3570 



3615 



3660 



3705 



3750 



3795 



3840 



age agg gga gag cac etc tta eta ttc tta gta cag aca gtc gca 
Ser Arg Gly Glu His Leu Leu Leu Phe Leu Val Gin Thr Val Ala 
1270 1275 1280 



3885 
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agg cag acg gtg gag cat tgt caa tac agg cca cct cga ate agg 
Arg Gin Thr Val Glu His Cys Gin Tyr Arg Pro Pro Arg He Ara 
1285 1290 1295 

gaa gat cgt aac cga aaa aca gec agt cct gaa gat tea gat atg 
Glu Asp Arg Asn Arg Lys Thr Ala Ser Pro Glu * Asp Ser Asp Met 
1300 1305 1310 

cca gat cat gat tta gag cct cca aga ttt gec cag ctt gca ttg 
Pro Asp His Asp Leu Glu Pro Pro Arg Phe Ala Gin Leu Ala Leu 
1315 • 1320 1325 

.gag cgt gtt eta cag gac tgg aat gec ttg aaa- tct atg att atg 
Glu Arg Val Leu Gin Asp Trp Asn Ala Leu Lys Ser Met He Met 
1330 1335 . 1340 

ttt ggg teg cag gag aat aaa gac cct ctt agt gec age agt aga 
Phe Gly Ser Gin Glu Asn Lys Asp Pro Leu Ser Ala Ser Ser Arg 
1345 1350 1355 



ata ggc cat ctt ttg cca gaa gag caa gta tac 
He Gly His Leu Leu. Pro Glu Glu Gin Val Tyr 
13 60 1365 1370 



etc aat cag caa 
Leu Asn Gin Gin 



agt ggc aca att egg ctg gac tgt ttc act cat tgc ctt ata gtt 

??* Gly Thr Ile Asp Cys Phe Thr His Cys Leu He Val 

1375 i3 8 o 1385 

aag tgt aca gca gat att ttg ctt tta gat act eta eta ggt aca 
Lys Cys Thr Ala Asp Ile Leu Leu Leu Asp Thr Leu Leu Gly Thr 
13 90 1395 1400 

eta gtg aaa gaa etc caa aac aaa tat aca cct gga cgt aga gaa 
Leu Val Lys Glu Leu Gin Asn Lys Tyr Thr Pro Gly Arg Arg Glu 
1405 1410 1415 

gaa get att get gtg aca atg agg ttt eta cgt tea gtg gca aga 
Glu Ala He Ala Val Thr Met Arg Phe Leu Arg Ser Val Ala Arg 
xq2D 1425 1430 

2^ 2* ?^ att Ctg agt 9tg gaa atg gct tca tcc aaa aa 9 aaa 
Val Phe Val He Leu Ser Val Glu Met Ala Ser Ser Lys Lys Lys 
143 5 1440 1445 

aac aac ttt att cca cag cca att gga aaa tgc aag cgt gta ttc 
Asn Asn Phe He Pro Gin Pro He Gly Lys Cys Lys Arg Val Phe 
1450 1455 1460 

caa gca ttg eta cct tac gct gtg gaa gaa ttg tgc aac gta gca 
Gin Ala Leu Leu Pro Tyr Ala Val Glu Glu Leu Cys Asn Val Ala 
1465 1470 1475 

gag tea ctg att gtt cct gtc aga atg ggg att gct cgt cca act 
Glu Ser Leu lie Val Pro Val Arg Met Gly Ile Ala Arg Pro Thr 
148 ° 1485 1490 

gca cca ttt acc ctg gct agt act age ata gat gec atg cag ggc 
Ala Pro Phe Thr Leu Ala Ser Thr Ser He Asp Ala Met Gin Gly 
1495 isoo 1505 

agt gaa gaa tta ttt tca gtg gaa cca eta cca cca cga cca tca 
Ser Glu Glu Leu Phe Ser Val Glu Pro Leu Pro Pro Arg Pro Ser 
1510 1515 1520 



tct gat cag tct age age 
Ser Asp Gin Ser Ser Ser 
1525 1530 



tcc agt cag tct cag tca tcc tac ate 
Ser Ser Gin Ser Gin Ser Ser Tyr Ile 
1535 



3930 



3975 



4020 



4065 



4110 



4155 



4200 



4245 



4290 



4335 



4380 



4425 



4470 



4515 



4560 



4605 



4650 
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ate agg aat cca cag cag agg cgc ate age cag tea cag ccc gtt 4695 
lie Arg Asn Pro Gin Gin Arg Arg lie Ser Gin Ser Gin Pro Val 
1540 1545 1550 

egg ggc aga gat gaa gaa cag gat gat att gtt tea gca gat gtg 4740 
Arg Gly Arg Asp Glu Glu Gin Asp Asp He Val Ser Ala Asp Val 
1555 1560 1565 

gaa gag gtt gag gtg gtg gag ggt gtg get gga gaa gag gat cat 4785 
Glu Glu Val Glu Val Val Glu Gly Val Ala Gly Glu Glu Asp His 
1570 1575 1580 

cat gat gaa cag gaa gaa cac ggg gaa gaa aat get gag gca gag 4830 
His Asp Glu Gin Glu Glu His Gly Glu Glu Asn * Ala Glu Ala Glu 
1585 1590 1595 

gga caa cat gat gag cat gat gaa gac ggg agt gat atg gag ctg 4875 
Gly Gin His Asp Glu His Asp Glu Asp Gly Ser Asp Met Glu Leu 
1600 1605 1610 

gac ttg tta gca gca get gaa aca gaa agt gat agt gaa agt aac 4920 
Asp Leu Leu Ala Ala Ala Glu Thr Glu Ser Asp Ser Glu Ser Asn 
1615 1620 1625 

cac age aac caa gat aat get agt ggg cgc aga age gtt gtc act 4965 
His Ser Asn Gin Asp Asn Ala Ser Gly Arg Arg Ser Val Val Thr 
1630 1635 1640 

gca gca act get ggt tea gaa gca gga gca age agt gtt cct gee 5010 
Ala Ala Thr Ala Gly Ser Glu Ala Gly Ala Ser Ser Val Pro Ala 
1645 1650 1655 

ttc ttt tct gaa gat gat tct caa teg aat gac tea agt gat tct 5055 
Phe Phe Ser Glu Asp Asp Ser Gin Ser Asn Asp Ser Ser Asp Ser 
1660 1665 . 1670 

gat age agt agt agt cag agt gac gac ata gaa cag gag ace ttt 5100 
Asp Ser Ser Ser Ser Gin Ser Asp Asp He Glu Gin Glu Thr Phe 
1675 1680 1685 

atg ctt gat gag cca tta gaa aga ace aca aat age tec cat gee 5145 
Met Leu Asp Glu Pro Leu Glu Arg Thr Thr Asn" Ser Ser His Ala 
1690 1695 1700 

aat ggt get gee caa get ccc cgt tea atg cag tgg get gtc cgc 5190 
Asn Gly Ala Ala Gin Ala Pro Arg Ser Met Gin Trp Ala Val Arg 
1705 1710 1715 

aac ace cag cat cag cga gca gee agt aca gee cct tec agt aca 5235 
Asn Thr Gin His Gin Arg Ala Ala Ser Thr Ala Pro Ser Ser Thr 
1720 1725 1730 

tct aca cca gca gca agt tea gcg ggt ttg att tat att gat cct 5280 
Ser Thr Pro Ala Ala Ser Ser Ala Gly Leu He Tyr He Asp Pro 
1735 1740 1745 

tea aac tta cgc egg agt ggt acc ate agt aca agt get gca get 5325 
Ser Asn Leu Arg Arg Ser Gly Thr He Ser Thr Ser Ala Ala Ala 
1750 1755 1760 

gca gca get get ttg gaa get age aac gee age agt tac eta aca 5370 
Ala Ala Ala Ala Leu Glu Ala Ser Asn Ala Ser Ser Tyr Leu Thr 
1765 1770 1775 

tct gca age agt tta gee agg get tac age att gtc att aga caa 5415 
Ser Ala Ser Ser Leu Ala Arg Ala Tyr Ser He Val He Arg Gin 
1780 1785 1790 
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ate teg gac ttg atg ggc ctt att cct aag tat aat cac eta gta 
He Ser Asp Leu Met Gly Leu He Pro Lys Tyr Asn His Leu Val 
1795 isoo isos 

tac tct cag att cca gca get gtg aaa ttg act tac caa gat gca 
Tyr Ser Gin He Pro Ala Ala Val Lys Leu Thr Tyr Gin Asp Ala 
181 0 1815 1820 

gta aac tta cag aac tat gta gaa gaa aag ctt att ccc act tgg 
Val Asn Leu Gin Asn Tyr Val Glu Glu Lys Leu He Pro Thr Trp 
1825 1830 1835 

aac tgg atg gtc agt att atg gat tct act gaa get caa tta cgt 
Asn Trp Met Val Ser He Met Asp Ser Thr Glu Ala Gin Leu Arg 
1840 1845 1850 

tat ggt tct gca tta gca tct get ggt gat cct gga cat cca aat 
Tyr Gly Ser Ala Leu Ala Ser- Ala Gly Asp Pro Gly His Pro Asn 
1855 I860 . 1865 

cat cct ctt cac get tct cag aat tea gcgaga - aga gag agg atg 
His Pro Leu His Ala Ser Gin Asn Ser Ala Arg Arg Glu Arg Met 
1870 1875 1880 

act gcg cga gaa gaa get age tta cga aca ctt gaa ggc aga cga 
Thr Ala Arg Glu Glu Ala Ser Leu Arg Thr Leu Glu Gly Arg Arg 
1885 1890 1895' 



cgt gee ace ttg ctt age 
Arg Ala Thr Leu Leu Ser 
1900 1905 



gee cgt caa gga atg 
Ala Arg Gin Gly Met 
1910 



atg tct gca cga 
Met Ser Ala Arg 



gga 
Gly 
•1915 



gac ttc eta aat tat get ctg tct eta atg egg tct cat aat 
Asp Phe Leu Asn Tyr Ala Leu Ser Leu Met Arg Ser His Asn 
1920 1925 



gat gag cat tct gat gtt ctt cca gtt ttg gat gtt tgc tea ttg 
Asp Glu His Ser Asp Val Leu Pro Val Leu Asp Val Cys Ser Leu 
1930 1935 1940 

aag cat gtg gca tat gtt ttt caa gca ctt ata tac tgg att aag 
Lys His Val Ala Tyr Val Phe Gin Ala Leu He Tyr Trp He Lys 
1945 1950 1955 

gca atg aat cag cag aca aca ttg gat aca cct caa eta gaa cgc 
Ala Met Asn Gin Gin Thr Thr Leu Asp Thr Pro Gin Leu Glu Arg 
^60 1965 . 1970 

aaa agg acg cga gaa etc ttg gaa ctg ggt att gat aat gaa gat 
Lys Arg Thr Arg Glu Leu Leu Glu Leu Gly He Asp Asn Glu Asp 
1975 1980 1985 

tea gaa cat gaa aat gat gat gac ace aat caa agt get act ttg 
Ser Glu His Glu Asn Asp Asp Asp Thr Asn Gin Ser Ala Thr Leu 
1990 1995 2000 

aat gat aag gat gat gac tct ctt cct gca gaa act ggc caa aac 
Asn Asp Lys Asp Asp Asp Ser Leu Pro Ala Glu Thr Gly Gin Asn 
2005 2010 2015 

cat cca ttt ttc cga cgt tea gac tec atg aca ttc ctt ggg tgt 

Pr ° Phe Phe Arq Arg Ser As P Ser Met Phe Leu. Gly Cys 

2020 2025 2030 

ata ccc cca aat cca ttt gaa gtg cct ctg get gaa gee ate ccc 
He Pro Pro Asn Pro Phe Glu Val Pro Leu Ala Glu Ala He Pro 
2035 2040 2045 



5460 



5505 



5550 



5595 



5640 



5685 



5730 



5775 



5820 



5865 



5910 



5955 



6000 



6045 



6090 



6135 



6180 
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ttg get gat cag cca cat ctg ttg cag cca aat get aga aag gag 
Leu Ala Asp Gin Pro His Leu Leu Gin Pro Asn Ala Arg Lys Glu 
2050 2055 2060 



gat ctt ttt ggc cgt cca agt cag ggt ctt tat 
Asp Leu Phe Gly Arg Pro Ser Gin Gly Leu Tvr 
2065 2 070 2075 



tct tea tct gec 
Ser Ser Ser Ala 



agt agt ggg aaa tgt tta atg gag gtt aca gtg gat aga aac tgc 

onL Ser GlY Lys Cys Leu Met Glu Val Thr Va l Asp Arg Asn Cys 
2080 2085 2090 

eta gag gtt ctt cca aca aaa atg tct tat get gee aat ctg aaa 
Leu Glu Val Leu Pro Thr Lys Met Ser. Tyr Ala Ala Asn Leu Lys 
2095 2100 2105 



aat gta atg aac atg caa 
Asn Val Met Asn Met Gin 
2H0 2115 



aac egg caa aaa aaa 
Asn Arg Gin Lys Lys 
2120 



gaa ggg gaa gaa 
Glu Gly Glu Glu 



cag ccc gtg ctg cca gaa gaa act gag agt tea aaa cca ggg cca 
Gin Pro Val Leu. Pro Glu Glu- Thr Glu Ser Ser Lys Pro Gly Pro 
2125 2130 2135 

tit IT w 3t r fct g f t 9Ca Caa tta aaa agt agc tta cta 9oa 
Ser Ala Hxs Asp Leu Ala Ala Gin Leu Lys Ser Ser Leu Leu Ala 

2140 2145 2XS0 



gaa ata gga ctt act gaa agt gaa ggg cca cct 
Glu He Gly Leu Thr Glu Ser Glu Gly Pro Pro 
2155 2160 2165 



etc aca tct ttc 
Leu Thr Ser Phe 



agg cca cag tgt age ttt atg gga atg gtt att tec cat gat atg 
Arg Pro Gin Cys Ser Phe Met Gly Met Val He Ser His Asp Met 
2170 2175 2180 



ctg cta gga cgt tgg cgc ctt tct tta gaa ctg 
Leu Leu Gly Arg Trp Arg Leu Ser Leu Glu Leu 
2185 2190 2195 



ttc ggc agg gta 
Phe Gly Arg Val 



^ I I H? a 9at 9tt gga gCa gaa cct gga tca a tc cta act gaa 
Phe Met Glu Asp Val Gly Ala Glu Pro Gly Ser He Leu Thr Glu 
2200 2205 2210 



ttg 
Leu 



ggt ggt ttt gag gta aaa gaa tca aaa ttc cgc aga gaa atg 
R Gly Phe Glu Val Lys Glu Ser Lys Phe Arg Arg Glu Met 
2220 2225 



gaa aaa ctg aga' aac cag cag tca aga gat ttg tea cta gag gtt 
Glu Lys Leu Arg Asn Gin Gin Ser .Arg Asp Leu Ser Leu Glu Val 
2230 2235 2240 

gat egg gat cga gat ctt etc att cag cag act atg agg cag ctt 
Asp Arg Asp Arg Asp Leu Leu He Gin Gin Thr Met Arg Gin Leu 
2245 2250 2 255 

aac aat cac ttt ggt cga aga tgt get act aca cca atg get gta 
Asn Asn His Phe Gly Arg Arg Cys Ala Thr Thr Pro Met Ala Val 
2260 2265 2270 

cac aga gta aaa gtc aca ttt aag gat gag cca gga gag ggc agt 
Has Arg Val Lys Val Thr Phe Lys Asp Glu Pro Gly Glu Gly Ser 
22/5 2280 2285 



ggt gta gca cga agt ttt tat aca gee att gca 
Gly Val Ala Arg Ser Phe Tyr Thr Ala He Ala 
2290 2295 2300 



caa gca ttt tta 
Gin Ala Phe Leu 



6225 



6270 



6315 



6360 



6405 



6450 



6495 



6540 



6585 



6630 



6675 



6720 



6765 



6810 



6855 



6900 



6945 
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tea aat gaa aaa ttg cca aat eta gag tgt ate caa aat gee aac 
Ser Asn Glu Lys Leu Pro Asn Leu Glu Cys lie Gin Asn Ala Asn 
2305 2310 2315 

aaa ggc ace cac aca agt tta atg cag aga tta agg aac cga gga 
Lys Gly Thr His Thr Ser Leu Met Gin Arg Leu Arg Asn Arg Gly 
2320 2325 2330 

gag aga gac egg gaa agg gag aga gaa agg gaa atg agg agg agt 
Glu Arg Asp Arg Glu Arg Glu Arg Glu Arg Glu Met Arg Arg Ser 
2335 2340 2345 

agt ggt ttg cga gca ggt tct egg agg gac egg gat aga gac ttt 
Ser Gly Leu Arg Ala Gly Ser Arg Arg Asp Arg Asp Arg Asp Phe 
2350 2355 2360 

aga aga cag ctt tec ate gac act agg ccc ttt aga cca gee tct 
Arg Arg Gin Leu Ser He Asp Thr Arg Pro Phe Arg Pro Ala Ser 
2365 2370 2375' 

gaa ggg aat cct age gat gat cct gag cct ttg cca gca cat egg 
Glu Gly Asn Pro Ser Asp Asp. Pro Glu Pro Leu -Pro Ala His Arg 
2380 2385 2390 

cag gca ctt gga gag agg ctt. tat cct cgt gta caa gca atg caa 
Gin Ala Leu Gly Glu Arg Leu Tyr Pro Arg Val Gin Ala Met Gin 
2395 2400 2405 

cca gca ttt gca agt aaa ate act ggc atg ttg ttg gaa tta tec 
Pro Ala Phe Ala Ser Lys He Thr Gly Met Leu Leu Glu Leu Ser 
2410 2415 2420 

cca get cag ctg ctt etc ctt eta gca agt gag gat tct ctg aga 
Pro Ala Gin Leu Leu Leu Leu Leu Ala Ser Glu Asp Ser Leu Arg 
2425 2430 2435 

gca aga gtg gat gag gee atg gaa etc att att gca cat gga egg 
Ala Arg Val Asp Glu Ala Met Glu Leu He He Ala His Gly Arg 
2440 2445 2450 

gaa aat gga get gat agt ate ctg gat ctt gga tta gta gac tec 
Glu Asn Gly Ala Asp Ser He Leu Asp Leu Gly Leu Val Asp Ser 
2455 2460 2465 

tea gaa aag gta cag cag gaa aac cga aag cgc cat ggc tct agt 
Ser Glu Lys Val Gin Gin Glu Asn Arg Lys Arg His Gly Ser Ser 
2470 2475 2480 

cga agt gta gta gat atg gat tta gat gat aca gat gat ggt gat 
Arg Ser Val Val Asp Met Asp Leu Asp Asp Thr Asp Asp Gly Asp 
2485 2490 2495 

gac aat gee cct ttg ttt tac caa cct ggg aaa aga gga ttt tat 
Asp Asn Ala Pro Leu Phe Tyr Gin Pro Gly Lys Arg Gly Phe Tyr 
2500 2505 2510 

act cca agg cct ggc aag aac aca gaa gca agg ttg aat tgt ttc 
Thr Pro Arg Pro Gly Lys Asn Thr Glu Ala Arg Leu Asn Cys Phe 
2515 2520 2525 

aga aac att ggc agg att ctt gga eta tgt ctg tta cag aat gaa 
Arg Asn He Gly Arg He Leu Gly Leu Cys Leu Leu Gin Asn Glu 
2530 2535 2540 

etc tgt cct ate aca ttg aat aga cat gta att aaa gta ttg ctt 
Leu Cys Pro He Thr Leu Asn Arg His Val lie Lys Val Leu Leu 
2545 2550 2555 
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ggt aga aaa gtc aat tgg cat gat ttt get ttt ttt gat cct gta 
Gly Arg Lys Val Asn Trp His Asp Phe Ala Phe Phe Asp Pro Val 
2560 2565 2570 



atg tat gag agt ttg egg caa eta ate etc gcg 
Met Tyr Glu Ser Leu Arg Gin Leu lie Leu Ala 
2575 2580 ; 2585 



tct cag agt tea 
Ser Gin Ser Ser 



gat get gat get gtt ttc tea gca atg' gat ttg gca ttt gca att 
Asp Ala Asp Ala Val Phe Ser Ala Met Asp Leu Ala Phe Ala He 
2590 2595 ' 2600 



gaa etc att cct 
Glu Leu He Pro 



gac ctg tgt aaa gaa gaa ggt gga gga cag gtt 
Asp Leu Cys Lys Glu Glu Gly Gly Gly Gin Val 
2605 2610 * 2615 

aat ggt gta aat ata cca gtc act cca cag aat gta tat gag tat 
Asn Gly Val Asn He Pro Val Thr Pro Gin Asn Val Tyr Glu Tyr 
2620 2625 2630 

gtg egg aaa tac gca gaa cac aga atg ttg gta gtt gca gaa cag 
Val Arg Lys Tyr Ala Glu His Arg Met Leu Val Val Ala Glu Gin 
2635 2640 2645 

ccc tta cat gca atg agg aaa ggt eta eta gat gtg ctt cca aaa 
Pro Leu His Ala Met Arg Lys Gly Leu Leu Asp Val Leu Pro Lys 
2650 2655 2660 

aat tea tta gaa gat tta acg gca gaa gat ttt agg ctt ttg gta 
Asn Ser Leu Glu Asp Leu Thr Ala Glu Asp Phe Arg Leu Leu Val 
2665 2670 ' 2675 

aat ggc tgc ggt gaa gtc aat gtg caa atg ctg ate agt ttt acc 
Asn Gly Cys Gly Glu Val Asn Val Gin Met Leu He Ser Phe Thr 
2680 2685 2690 

tct ttc aat gat gaa tea gga gaa aat get gag aag ctt ctg cag 
Ser Phe Asn Asp Glu Ser Gly Glu Asn Ala Glu Lys Leu Leu Gin 
2695 2700 2705 

ttc aag cgt tgg ttc tgg tea ata gta gag aag atg age atg aca 
Phe Lys Arg Trp Phe Trp Ser He Val Glu Lys Met Ser Met Thr 
2710 2715 2720 

gaa cga caa gat ctt gtt tac ttt tgg aca tea age cca tea ctg 
Glu Arg Gin Asp Leu Val Tyr Phe Trp Thr Ser Ser Pro Ser Leu 
2725 2730 2735 

cca gee agt gaa gaa gga ttc cag cct atg ccc tea ate aca ata 
Pro Ala Ser Glu Glu Gly Phe Gin Pro Met Pro Ser He Thr He 
2740 2745 2750 

aga cca cca gat gac caa cat ctt cct act gca aat act tgc att 
Arg Pro Pro Asp Asp Gin His Leu Pro Thr Ala Asn Thr Cys He 
2755 2760 2765 

tct cga ctt tac gtc cca etc tat tec tct aaa cag att etc aaa 
Ser Arg Leu Tyr Val Pro Leu Tyr Ser Ser Lys Gin He Leu Lys 
2770 2775 2780 

cag aaa ttg tta etc gee att aag acc aag aat ttt ggt ttt gtg 
Gin Lys Leu Leu Leu Ala lie Lys Thr Lys Asn Phe Gly Phe Val 
2785 2790 2795 

tagagtataa aaagtgtgta ttgctgtgta atattactag caaattttgt agattttttt 
ccatttgtct ataaaagtta aagtttatgg aagttaatgc tgtcataccc ccctggtggt 
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8025 
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accttaaaga gataaaatgc agacattcct tgctgagttt atagcttaaa ggcctaagga 8610 

gcactagcaa catttggcta tattggtttg ctagtcacca acttctgggt ctaaccccag 8670 

ccaaagatga cagcagaaca acataattta cactgtgatt tatctttttg ctgaggggaa 8730 

aaaaatgtaa atgttctgaa aattcactgc tgcctttgtg gaaactgttt cagcaaaggt 8790 

tcttgtatag agggaatagg gaatttcaaa ataaaaaatt aagtatgt 8838 



<210> 


64 


<211> 


2799 


<212> 


PRT 


<213> 


NMJ)15902 


<400> 


64 



Met Thr Ser He His Phe Val Val His Pro Leu Pro Gly Thr Glu Asp 
1 5 10 15 



Gin Leu Asn Asp Arg Leu Arg Glu Val Ser Glu Lys Leu Asn Lys Tyr 
20 25 30 



Asn Leu Asn Ser His Pro Pro Leu Asn Val Leu Glu Gin Ala Thr He 
35 40 45 



Lys Gin Cys Val Val Gly Pro Asn His Ala Ala Phe Leu Leu Glu Asp 
50 55 60 



Gly Arg Val Cys Arg He Gly Phe Ser Val Gin Pro Asp Arg Leu Glu 
65 70 75 80 



Leu Gly Lys Pro Asp Asn Asn Asp Gly Ser Lys Leu Asn Ser Asn Ser 
85 90 95 



Gly Ala Gly Arg Thr Ser Arg Pro Gly Arg Thr Ser Asp Ser Pro Trp 
100 105 110 



Phe Leu Ser Gly Ser Glu Thr Leu Gly Arg Leu Ala Gly Asn Thr Leu 
115 120 125 



Gly Ser Arg Trp Ser Ser Gly Val Gly Gly Ser Gly Gly Gly Ser Ser 
130 135 14() 



Gly Arg Ser Ser Ala Gly Ala Arg Asp Ser Arg Arg Gin Thr Arg Val 
145 150 155 160 



He Arg Thr Gly Arg Asp Arg Gly Ser Gly Leu Leu Gly Ser Gin Pro 
165 170 175 
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Gin Pro Val lie Pro Ala Ser Val lie Pro Glu Glu Leu He- Ser Gin 
180 185 190 

Ala Gin Val Val Leu Gin Gly L ys Ser Arg Ser Val lie He Arg Glu 
195 200 205 

Leu Gin Arg Thr Asn Leu Asp Val Asn Leu Ala Val Asn Asn Leu Leu 
210 .215 220 

225 *** ~~ ASP ASP Gly A5P ASp Thr Ala Ser Glu 



230 



235 



240 



Ser Tyr Leu Pro Gly Glu Asp Leu Met Ser Leu Leu Asp Ala Asp He 
245 2 5 0 255 

His ser Ala His Pro Ser Val He lie Asp Ala Asp Ala Met Phe Ser 
260 265 270 



Glu Asp lie Ser Tyr Phe Gly Tyr Pro Ser Phe Arg Arg Ser Ser Leu 
275 280 285 

Ser Arg Leu Gly Ser Ser Arg Val Leu Leu Leu Pro Leu Glu Arg Asp 



295 



300 



Ser Glu Leu Leu Arg Glu Arg Glu Ser Val Leu Arg Leu Arg Glu Arg 



310 



315 



320 



Arg Trp Leu Asp Gly Ala Ser Phe Asp Asn Glu Arg Gly Ser Thr Ser 
325 330 335 



Lys Glu Gly Glu Pro Asn Leu Asp Lys Lys Asn Thr Pro Val Gin Ser 
340 345 350 

355 52! Gln TrP TrP Pr ° AS ? LyS G1 * 



360 



365 



Thr Lys Phe He Cys He Gly Ala Leu Tyr Ser Glu Leu Leu Ala Val 
370 375 380 

Ser Ser Lys Gly Glu Leu Tyr Gin Trp Lys Trp Ser Glu Ser Glu Pro 



390 



395 



400 



Tyr Arg Asn Ala Gin Asn Pro Ser Leu His His Pro Arg Ala Thr Phe 
405 4io 



415 



Leu Gly Leu Thr Asn Glu Lys lie Val Leu Leu Ser Ala Asn Ser He 
420 425 430 

Arg Ala Thr Val Ala Thr Glu Asn Asn Lys Val Ala Thr Trp Val Asp 



440 



445 
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Glu Thr Leu Ser Ser Val Ala Ser Lys Leu Glu His Thr Ala Gin Thr 
450 455 4 6 o 



Tyr Ser Glu Leu Gin Gly Glu Arg lie Val Ser Leu His Cys Cys Ala 
465 470 . 475 1 



480 



Leu Tyr Thr Cys Ala Gin Leu Glu Asn Ser Leu Tyr Trp Trp Gly Val 
485 .490 495 



Val Pro Phe Ser Gin Arg Lys Lys Met Leu Glu Lys Ala Arg Ala Lys 
500 505 



510 



Asn Lys Lys Pro Lys Ser Ser Ala Gly lie. Ser Ser Met Pro Asn lie 
515 520 525 



Thr Val Gly Thr Gin Val Cys Leu Arg Asn Asn Pro Leu Tyr His Ala 
530 535 



540 



Gly Ala Val Ala Phe Ser He Ser Ala Gly lie Pro Lys Val Gly Val 
545 550 555 560 

Leu Met Glu Ser Val Trp Asn Met Asn Asp Ser Cys Arg Phe Gin Leu 
565 570 575 

Arg Ser Pro Glu Ser Leu Lys Asn Met Glu Lys Ala Ser Lys Thr Thr 
580 585 590 

Glu Ala Lys Pro Glu Ser Lys Gin Glu Pro Val Lys Thr Glu Met Glv 
595 600 605 

Pro Pro Pro Ser Pro Ala Ser Thr Cys' Ser Asp Ala Ser Ser He Ala 
610 615 620. 

Ser Ser Ala Ser Met Pro Tyr Lys Arg Arg Arg Ser Thr Pro Ala Pro 
625 630 635 640 

Lys Glu Glu Glu Lys Val Asn Glu Glu Gin Trp Ser Leu Arg Glu Val 
645 650 655 

Val Phe Val Glu Asp Val Lys Asn Val Pro Val Gly Lys Val Leu Lys 
660 665 670 

Val Asp Gly Ala Tyr Val Ala Val Lys Phe Pro Gly Thr Ser Ser Asn 
675 680 685 

Thr Asn Cys Gin Asn Ser Ser Gly Pro Asp Ala Asp Pro Ser Ser Leu 
690 695 700 



Leu Gin Asp Cys Arg Leu Leu Arg He Asp Glu Leu Gin Val Val Lys 



710 



715 



720 
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Thr Gly Gly Thr Pro Lys Val Pro Asp Cys Phe Gin Arg Thr Pro Lys 
725 730 7 35 

Lys Leu Cys He Pro GIu Lys Thr Glu lie Leu Ala Val Asn Val Asd 
740 745 75Q 

Ser Lys Gly Val His Ala Val Leu Lys Thr. Gly Asn Trp Val Arg Tyr 
755 7 6 o 765 y 

Cys lie Phe Asp Leu Ala Thr Gly Lys Ala Glu Gin Glu Asn Asn Phe 
/0 775 780 

Pro Thr ser Ser He Ala Phe Leu Gly Gin Asn Glu Arg Asn Val Ala 
85 790 795 soo 

He Phe Thr Ala Gly Gin Glu Ser Pro lie He Leu Arg Asp' Gly Asn 
805 810 815 

Gly Thr He Tyr Pro Met Ala Lys Asp Cys Met Gly Gly He Arg Asp 
820 825 830 

Pro Asp Trp Leu Asp Leu Pro Pro He Ser Ser Leu Gly Met Gly Val 

His Ser Leu He Asn Leu Pro Ala Asn Ser Thr He Lys Lys Lys Ala 

855 egg 

Ala Val He He Met Ala Val Glu Lys Gin Thr Leu Met Gin His lie 
870 875 . 880 

Leu Arg Cys Asp Tyr Glu Ala Cys Arg Gin . Tyr Leu Met Asn Leu Glu 
885 ego 895 

Gin Ala Val Val Leu Glu Gin Asn Leu Gin Met Leu Gin Thr Phe He 
. 900 905 g 10 

Ser His Arg Cys Asp Gly Asn Arg Asn He Leu His Ala Cys Val Ser 
915 920 925 

Val Cys Phe Pro Thr Ser Asn Lys Glu Thr Lys .Glu Glu Glu Glu Ala 
930 935 940 

Glu Arg Ser Glu Arg Asn Thr Phe Ala Glu Arg Leu Ser Ala Val Glu 

Ala He Ala Asn Ala He Ser Val Val Ser Ser Asn Gly Pro Gly Asn 
965 970 975 

Arg Ala Gly Ser Ser Ser Ser Arg Ser Leu Arg Leu Arg Glu Met Met 
980 985 990 
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Arg Arg Ser Leu Arg Ala Ala Gly Leu Sly. Arg His Glu Ala Gly Ala 
995 1000 1005 

S " inTn ASP His Gln Asp Pro Val Ser Pro Pr ° He Ala Pro 
1010 1015 • 1020 

ttU ASP Pr ° Pr ° Ala M6t As P Pro ASP Gly Asp 

1025 1030 1035 



116 I04n ^ 116 L6U Ala Pr ° Ala Val Ser Le » Thr Thr Ala 

1045 io50 

Ala Thr Gly Thr Gly Gin Gly Pro Ser Thr Ser Thr He Pro Gly 
1055 1060 . 1065 Y 

Pro Ser Thr Glu Pro Ser Val Val Glu Ser Lys Asp Arg Lys Ala 
u 1075 io80 

Asn Ala His Phe He Leu Lys Leu Leu Cys Asp Ser Val Val Leu 
1085 1090 io95 

Gin Pr^ Tyr Leu Arg Glu Leu_ Leu Ser Ala Lys Asp Ala Arg Gly 

Met Thr Pro Phe Met Ser Ala Val Ser Gly Arg Ala Tyr Pro Ala 

U20 . 1125 

Ala lie Thr lie Leu Glu Thr Ala Gin Lys lie Ala Lys Ala Glu 
liJU 1135 ii 4 o 

He Ser Ser Ser Glu Lys Glu Glu Asp Val Phe Met Gly Met Val 

1150 H55 

Cys Pro^ Ser Gly Thr Asn Pro_ Asp Asp Ser Pro Leu^ Tyr Val Leu 



Cys Cys Asn Asp Thr Cys Ser Phe Thr Trp Thr Gly Ala Glu His 

1180 H85 

116 G1 " Asp Ile Phe Glu Cys Arg Thr Cys Gly Leu Leu Glu 

H95 1200 

£ns G1U CyS Ala Val <*S His Lys Gly 

1210 X215 

HiS A 9 P o ^ LeU LyS Arg Thr Ser Pro Thr Ala ^ Cys Asp 

1225 1230 

Cys Trp Glu Lys Cys Lys Cys Lys Thr Leu lie Ala Gly Gin Lys 
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1235 1240 1245 

Ser Ala Arg Leu Asp Leu Leu Tyr Arg Leu Leu Thr Ala Thr Asn 
1250 1255 1260 

Leu Val Thr Leu Pro Asn Ser Arg Gly Glu His Leu Leu Leu Phe 
1265 1270 1275 

Leu Val Gin Thr Val Ala Arg Gin Thr Val Glu His Cys Gin Tyr 
1280 1285 1290 

Arg Pro Pro Arg He Arg Glu Asp Arg Asn Arg Lys Thr Ala Ser 
1295 . 1300 1305 

Pro Glu Asp Ser Asp Met Pro Asp His Asp Leu Glu Pro Pro Arg 
1310 1315 1320 

Phe Ala Gin Leu Ala Leu Glu Arg Val Leu Gin Asp Trp Asn Ala 
1325 1330 1335 

Leu Lys Ser Met He Met Phe Gly Ser Gin Glu Asn Lys Asp Pro 
1340 1345 1350 



Leu Ser Ala Ser Ser Arg He Gly His Leu Leu Pro Glu Glu Gin 
1355 1360 1365 



Val Tyr Leu Asn Gin Gin Ser Gly Thr He Arg Leu Asp Cys Phe 
1370 1375 1380 



Thr His Cys Leu He Val Lys Cys Thr Ala Asp He Leu Leu Leu 
1385 1390 1395 



Asp Thr Leu Leu Gly Thr Leu Val Lys Glu Leu Gin Asn Lys Tyr 
1400 1405 1410 



Thr Pro Gly Arg Arg Glu Glu Ala He Ala Val Thr Met Arg Phe 
1415 1420 1425 



Leu Arg Ser Val Ala Arg Val Phe Val lie Leu Ser Val Glu Met 
1430 1435 1440 



Ala Ser Ser Lys Lys Lys Asn Asn Phe He Pro Gin Pro He Gly 
1445 1450 1455 



Lys Cys Lys Arg Val Phe Gin Ala Leu Leu Pro Tyr Ala Val Glu 
1460 1465 1470 



Glu Leu Cys Asn Val Ala Glu Ser Leu He Val Pro Val Arg Met 
1475 1480 1485 
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Gly He Ala Arg Pro Thr Ala Pro Phe Thr Leu Ala Ser Thr Ser 
1490 1495 1500 

He Asp Ala Met Gin Gly Ser Glu Glu Leu Phe Ser Val Glu Pro 
1505 1510 153.5 

Leu Pro Pro Arg Pro Ser Ser Asp Gin Ser Ser Ser Ser Ser Gin 
1520 1525 1530 

Ser Gin Ser Ser Tyr He He Arg Asn Pro Gin Gin Arg Arq He 
1535 1540 1545 

Ser Gin Ser Gin Pro Val Arg Gly Arg Asp Glu Glu Gin Asp Asp 
1550 1555 1560 

He Val Ser Ala Asp Val Glu Glu Val Glu Val Val Glu Gly Val 
1565 1570 1575 

Ala Gly Glu Glu Asp His His Asp Glu Gin Glu Glu His Gly Glu 
1580 1585 1590 

Glu Asn Ala Glu Ala Glu Gly Gin His Asp Glu His Asp Glu Asp 
1595 i 6 oo 1605 

Gly Ser Asp Met Glu Leu Asp Leu Leu Ala Ala Ala Glu Thr Glu 
1 6 1° 1615 1620 

Ser Asp Ser Glu Ser Asn His Ser Asn Gin Asp Asn Ala Ser Glv 
1625 1630 i 6 35 

Arg Arg Ser Val Val Thr Ala Ala Thr Ala Gly Ser Glu Ala Gly 
1640 1645 i 6 5o 

Ala Ser Ser Val Pro Ala Phe Phe Ser Glu Asp Asp Ser Gin Ser 
X655 - 1660 1665 

Asn Asp Ser Ser Asp Ser Asp Ser Ser Ser Ser Gin Ser Asp Asp 
1670 1675 1680 

He Glu Gin Glu Thr Phe Met Leu Asp Glu Pro Leu Glu Arg Thr 
1685 1690 1695 

Thr Asn Ser Ser His Ala Asn Gly Ala Ala Gin Ala Pro Arg Ser 
1700 1705 1710 

Met Gin Trp Ala Val Arg Asn Thr Gin His Gin Arg Ala Ala Ser 
1715 1720 1725 



Thr Ala Pro Ser Ser Thr Ser Thr Pro Ala Ala Ser Ser Ala Gly 
1730 1735 1740 
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Leu lie Tyr lie Asp Pro Ser Asn Leu Arg Arg Ser Gly Thr lie 
1745 1750' 1755- 

Ser Thr Ser Ala Ala Ala Ala Ala Ala Ala Leu Glu Ala Ser Asn 
1760 1765 ' 1770 

Ala Ser Ser Tyr Leu Thr Ser Ala Ser Ser Leu Ala Arg Ala Tyr 
1775 1780 1785 

Ser lie Val He Arg Gin He Ser Asp Leu Met Gly Leu He Pro 
1790 1795 1800 

. Lys Tyr Asn His Leu Val Tyr Ser Gin He Pro Ala Ala Val Lys 
1805 1810 i 8 i5 

Leu Thr Tyr Gin Asp Ala Val Asn Leu Gin Asn Tyr Val Glu Glu 
1820 1825 . 1830 

LyS Ile Pr ° Thr Trp Asn Tr P Met Val Ser He Met Asp Ser 

1835 1840 1845 

Th? ?1« Ala Gln LeU Arg G± y Ser Ala Leu Ala Ser Ala Gly 

1850 1855 i 86 o 

Asp Pro Gly His Pro Asn His Pro Leu His Ala Ser Gin Asn Ser 
18 <S5 1870 1875 

• Ala Arg Glu Arg Met Thr Ala Arg Glu Glu Ala Ser Leu Arg 

1880 1885 1890 

Thr Leu Glu Gly Arg Arg Arg Ala Thr Leu Leu Ser Ala Arg Gin 
1895 1900 1905 

Gly Met Met Ser Ala Arg Gly Asp Phe Leu Asn Tyr Ala Leu Ser 
1910 1915 1920 

Leu Met Arg Ser His Asn Asp Glu His Ser Asp Val Leu Pro Val 
* 9 25 1930 1935 

Leu Asp Val Cys Ser Leu Lys His Val Ala Tyr Val Phe Gin Ala 
1940 1945 1950 

Leu lie Tyr Trp He Lys Ala Met Asn Gin Gin Thr Thr Leu Asp 
1955 I960 1965 

Thr Pro Gin Leu Glu Arg Lys Arg Thr Arg Glu Leu Leu Glu Leu 
1970 1975 198O 



Gly Ile Asp Asn Glu Asp Ser Glu His Glu Asn Asp Asp Asp Thr 
1985 1990 1995 
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ASn Ser Ala Thr Leu Asn As P Lys As P As P Asp Ser Leu Pro 

2000 2005 2010 

Ala Glu Thr Gly Gin Asn His Pro Phe Phe Arg Arg Ser Asp Ser 
2015 2020 2025 

Met Thr Phe Leu Gly Cys lie Pro Pro Asn Pro Phe Glu Val Pro 
2030 2035 2040 

Leu Ala Glu Ala He Pro Leu Ala Asp Gin Pro His Leu Leu Gin 
2045 2050 2055 

Pro Asn Ala Arg Lys Glu Asp Leu Phe Gly Arg Pro Ser Gin Gly 
2060 2065 2070 

L6U oXL Ser Ser Ser Ala Ser Ser Glv Cya Met Glu Val 

2075 2080 2085 

Thr Val Asp Arg Asn Cys Leu Glu Val Leu Pro Thr Lys Met Ser 
2090 2095 2100 

Tyr Ala Ala Asn Leu Lys Asn Val Met Asn Met Gin Asn Arg Gin 
2105 2110 • 2115 

Lys Lys Glu Gly Glu Glu Gin Pro Val Leu Pro Glu Glu Thr Glu 
2120 2125 2130 

SSr !!L LyS Pr ° Gly Pro Ser Ma His As P Leu Ala Ala Gin Leu 
2135 2140 2145 

Lys Ser Ser Leu Leu Ala Glu He Gly Leu Thr Glu Ser Glu Gly 
2150 2155. 2160 

Pro Pro Leu Thr Ser. Phe Arg Pro Gin Cys Ser Phe Met Gly Met 
2165 2170 2175 

Val ~L Ser H±S ASP Met heu Leu G1 y A* 1 ? Trp Arg Leu Ser Leu 
2180 2185 2190 

Glu Leu Phe Gly Arg Val Phe Met Glu Asp Val Gly Ala Glu Pro 
2195 2200 2205 

Gly Ser He Leu Thr Glu Leu Gly Gly Phe Glu Val Lys Glu Ser 
2210 2215 2220 

Lys. Phe Arg Arg Glu Met Glu Lys Leu Arg Asn Gin Gin Ser Arg 
2225 2230 2235 

Asp Leu Ser Leu Glu Val Asp Arg Asp Arg Asp Leu Leu He Gin 
2240 2245 2250 
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Gin Thr Met Arg Gin Leu Asn Asn His Phe Gly Arg Arg Cys Ala 
2255 2260 2265 

Thr Thr Pro Met Ala Val His Arg Val Lys Val Thr Phe Lys Asp 
2270 2275 2280 

Glu Pro Gly Glu Gly Ser Gly Val Ala Arg Ser Phe Tyr Thr Ala 
2285 2290 2295 

lie Ala Gin Ala Phe Leu Ser Asn Glu Lys Leu Pro Asn Leu Glu 
2300 2305 2310 

Cys He Gin Asn Ala Asn Lys Gly Thr His Thr Ser Leu Met Gin 
2315 2320 2325 

Arg Leu Arg Asn Arg Gly Glu Arg Asp Arg Glu Arg Glu Arg Glu 
2330 2335 2340 

Arg Glu Met Arg Arg Ser Ser Gly Leu Arg Ala Gly Ser Arg Arg 
2345 2350 2355 

Asp Arg Asp Arg Asp Phe Arg Arg Gin Leu Ser He Asp Thr Arg 
2360 2365 2370 

Pro Phe Arg Pro Ala Ser Glu Gly Asn Pro Ser Asp Asp Pro Glu 
2375 2380 2385 

Pro Leu Pro Ala His Arg Gin Ala Leu Gly Glu Arg Leu Tyr Pro 
2390 2395 2400 

Arg Val Gin Ala Met Gin Pro Ala Phe Ala Ser Lys He Thr Gly 
2405 2410 2415 

Met Leu Leu Glu Leu Ser Pro Ala Gin Leu Leu Leu Leu Leu Ala 
2420 2425. 2430 

Ser Glu Asp Ser Leu Arg Ala Arg Val Asp Glu Ala Met Glu Leu 
2435 2440 2445 

He He Ala His Gly Arg Glu Asn Gly Ala Asp Ser He Leu Asp 
2450 2455 2460 

Leu Gly Leu Val Asp Ser Ser Glu Lys Val Gin Gin Glu Asn Arg 
2465 2470 2475 

Lys Arg His Gly Ser Ser Arg Ser Val Val Asp Met Asp Leu Asp 
2480 2485 2490 

Asp Thr Asp Asp Gly Asp Asp Asn Ala Pro Leu Phe Tyr Gin Pro 
2495 2500 2505 
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Gly Lys Arg Gly Phe Tyr Thr Pro Arg Pro Gly Lys Asn Thr Giu 
2510 2515 2520 

Ala Arg Leu Asn Cys Phe Arg Asn He Gly Arg He Leu Gly Leu 
2525 2530 2535 

Cys Leu Leu Gin Asn Glu Leu Cys Pro He Thr Leu Asn Arg His 
2540 2545 2550 

Val He Lys Val Leu Leu Gly Arg Lys Val Asn Trp His Asp Phe 
2555 2560 2565 

Ala Phe Phe Asp Pro Val Met Tyr Glu Ser Leu Arg Gin Leu He 
2570 2575 2580 

Leu Ala Ser Gin Ser Ser Asp Ala Asp Ala Val Phe Ser Ala Met 
2585 2590 2595 

Asp Leu Ala Phe Ala He Asp Leu Cys Lys Glu Glu Gly Glv Gly 
2600 2605 2610 

Gin Val Glu Leu He Pro Asn Gly Val Asn He Pro Val Thr Pro 
2615 2620 2625 

Gin Asn Val Tyr Glu Tyr Val Arg Lys Tyr Ala Glu His Arg Met 
2630 2635 2640 



Leu Val Val Ala Glu Gin Pro Leu His Ala Met Arg Lys Gly Leu 
2645 2650 2655 

Leu Asp Val Leu Pro Lys Asn Ser Leu Glu Asp Leu Thr Ala Glu 
266 0 2665 2670 

Asp Phe Arg Leu Leu Val Asn Gly Cys Gly Glu Val Asn Val Gin 
2675 2680 2685 

Met Leu He Ser- Phe Thr Ser ■ Phe Asn Asp Glu Ser Gly Glu Asn 
2690 2695 2700 

Ala Glu Lys Leu Leu Gin Phe Lys Arg Trp Phe Trp Ser He Val 
2705 2710 2715 

Glu Lys Met Ser Met Thr Glu Arg Gin Asp Leu Val Tyr Phe Trp 
2720 2725 2730 

Thr Ser Ser Pro Ser Leu Pro Ala Ser Glu Glu Gly Phe Gin Pro 
2735 2740 2745 



Met Pro Ser He Thr He Arg Pro Pro Asp Asp Gin His Leu Pro 
2750 2755 2760 
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Thr Ala Asn Thr Cys lie Ser Arg Leu Tyr Val Pro Leu Tyr Ser 
2765 2770 2775 



Ser Lys Gin lie Leu Lys Gin Lys Leu Leu Leu Ala lie Lys Thr 
2780 2785 " 2790 



Lys Asn Phe Gly Phe Val 
2795 



<210> 65 

<211> 4256 

<212> DNA 

<213> NM 014992 DAAM1 



<220> 

<221> CDS 

<222> (126) .. (3359) 

<223> 



<400> 65 

tacgagcggc tgtgaaggaa actgtttaac cggatcccat tgtaeccaga gtgcagagcc 60 

gcctttccag catgcagggg ctgctcagcg tttagtcac'a tcaagaaata gaacagaatt 120 

cagcc atg gcc cca aga aag aga ggt gga cga ggt att tea ttc ate ttt 170 
Met Ala Pro Arg Lys Arg Gly Gly Arg Gly lie Ser Phe lie Phe 
15 10 15 

tgc tgt ttc cga aat aat gat cac cca gaa ate acg tat egg ctg cga 218 
Cys Cys Phe Arg Asn Asn Asp His Pro Glu He Thr Tyr Arg Leu Arg 
20 25 30 

aat gat age aac ttt gcg ctt cag acc atg gaa cca gca ttg ccc atg 266 
Asn Asp Ser Asn Phe Ala Leu Gin Thr Met Glu Pro Ala Leu Pro Met 
35 40 45 

ccc cct gtg gag gag ctg gat gtc atg ttc agt gaa ctg gtg gat gaa 314 
Pro Pro Val Glu Glu Leu Asp Val Met Phe Ser Glu Leu Val Asp Glu 
50 55 60 

ctg gac etc aca gac aaa cac aga gaa gcc atg ttt gca ctt cca get 362 
Leu Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala 
65 70 75 

gag aaa aaa tgg caa ata tac tgt age aag aaa aag gac cag gaa gaa 410 
Glu Lys Lys Trp Gin He Tyr Cys Ser Lys Lys Lys Asp Gin Glu Glu 
80 85 90 95 

aac aag gga get aca agt tgg cct gaa ttc tac att gat cag etc aat 458 
Asn Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr He Asp Gin Leu Asn 
100 105 110 

tec atg get get aga aaa tct ctg ctg get tta gag aag gaa gaa gaa 506 
Ser Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu 
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115 120 125 

gaa gaa aga agt aaa act ata gag agt tta aag aca gca ctg agg aca 554 
Glu Glu Arg Ser Lys Thr He Glu Ser Leu Lys Thr Ala Leu Arg Thr 
130 135 140 

aaa cca atg agg ttt gta acc aga ttc ate gac ttg gat ggc eta tea 602 
Lys Pro Met Arg Phe Val Thr Arg Phe He Asp Leu Asp Gly Leu Ser 
145 150 155 

tgt ate etc aac ttt eta aag acc atg gac tac gag acc tea gag tct 650 
Cys He Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser 
160 165 170 175 

cga ata cat act tct etc att ggc tgt ata aag gcg tta atg aac aac 698 
Arg He His Thr Ser Leu He Gly Cys He Lys Ala Leu Met Asn Asn 
180 185 190 

tct caa ggc egg get cac gtc ctg get cat tct gag agt att aat gta 746 
Ser Gin Gly Arg Ala His Val Leu Ala His Ser Glu Ser He Asn Val 
195 200 205 

att get cag agt ctg age aca gag aac att aaa acg aag gtg gec gtg 794 
He Ala Gin Ser Leu Ser Thr Glu Asn He Lys. Thr Lys Val Ala Val 
210 215 220 

ctg gaa ate ttg ggc gec gtg tgc ctg gtt ccc ggg ggc cac aag aag 842 
Leu Glu He Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys 
225 230 235 

gtt ctg cag gee atg ctg cac tac cag aag tat gec age gaa agg acc 890 
Val Leu Gin Ala Met Leu His Tyr Gin Lys Tyr Ala Ser Glu Arg Thr 
240 245 t 250 255 

cgc ttt cag aca tta. att aac gac ttg gat aaa age act ggg egg tat 938 
Arg Phe Gin Thr Leu He Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr 
260 265 270 

cga gat gaa gtg agt etc aag act gec ate atg tec ttc att aat gca 986 
Arg Asp Glu Val Ser Leu Lys Thr Ala He Met Ser Phe He Asn Ala 
275 280 285 

gtg etc age caa ggt gca gga gtg gag agt ttg gac ttt aga ctt cat 1034 
Val Leu Ser Gin Gly Ala Gly Val Glu Ser Leu Asp Phe Arg Leu His 
290 295 300 

ctt cgc tat gaa ttt ctg atg tta gga att caa cct gta ata gat aaa 1082 
Leu Arg Tyr Glu Phe Leu Met Leu Gly He Gin Pro Val He Asp Lys 
305 310 315 

tta agg gaa cac gaa aat tea aca tta gat agg cat tta gac ttt ttt 1130 
Leu Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe 
320 325 330 335 

gaa atg etc cga aat gaa gat gaa eta gaa ttt gee aaa aga ttt gaa 1178 
Glu Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu 
340 345 350 

ctg gtt cac ata gac aca aaa agt gca act cag atg ttt gag ctg acc 1226 
Leu Val His He Asp Thr Lys Ser Ala Thr Gin Met Phe Glu Leu Thr 
355 360 365 

agg aag agg ctg aca cat agt gaa get tac ccg cat ttc atg tec ate 1274 
Arg Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser He 
370 375 380 

ctg cac cac tgc etc caa atg cct tac aag agg agt ggc aac act gtt 1322 
Leu His His Cys Leu Gin Met Pro Tyr Lys Arg Ser Gly Asn Thr Val 
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385 390 395 

, cag tac tgg eta eta eta gat aga att at a cag cag ata gtt ate cag 1370 
Gin Tyr Trp Leu Leu Leu Asp Arg lie lie Gin Gin lie Val He Gin 
400 405 410. 415 

aat gac aaa gga cag gac cct gac tec aca cct ttg gaa aac ttt aat 1418 
Asn Asp Lys Gly Gin Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn 
420 425 430 

att aag aat gtc gta cga atg ttg gtt aat gaa aat gaa gtt aag cag 1466 
He Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gin 
435 440 445 

tgg aaa gaa caa gcg gaa aaa atg aga aaa gag cac aat gag. eta caa 1514 
Trp Lys Glu Gin Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gin 
450 455 .460 

cag aaa ctg gaa aag aaa gaa cga gaa 'tgt gat get aag act caa gag 1562 
Gin Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gin Glu 
465 470 . 475 

aag gaa gag atg atg cag ace tta aat aaa atg aaa gag aaa ctt gaa 1610 
Lys Glu Glu Met Met Gin Thr Leu Asn Lys Met Lys Glu Lys Leu Glu 
480 485 490 495 

aag gag act act gag cat aag caa gtc aag cag cag gtg gcg gac etc 1658 
Lys Glu Thr Thr Glu His Lys Gin Val Lys Gin Glh Val Ala Asp Leu 
500 505 510 

aca gca cag etc cat gag etc age agg agg gec gtc tgt get tea ate 1706 
Thr Ala Gin Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser He 
515' 520 525 

cca ggt gga ccc teg cct gga gca cca gga ggg ccc ttt cct tec tct 1754 
Pro Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser 
530 535 540 

gtg cct gga tct etc ctt cct ccc cca cca ccc cca cct eta cca ggt 1802 
Val Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly 
545 550 555 

ggg atg ctt ccc cct cca ccg cct ccc etc cct cca ggt ggc cct cct 1850 
Gly Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro 
560 565 570 ** 575 

cct ccc cca ggg cct cct ccc tta ggg gca ate atg cca cct cct ggt 1898 
Pro Pro Pro Gly Pro Pro Pro Leu Gly Ala He Met Pro Pro Pro Gly 
580 • 585 590 

get cca atg ggc eta gca ctg aag aag aaa age att cct cag ccc aca 1946 
Ala Pro Met Gly Leu Ala Leu Lys Lys Lys Ser He Pro Gin Pro Thr 
595 600 605 

aat gee ctg aaa tec ttc aac tgg tct aaa ctg ccc gag aac aaa ctg 1994 
Asn Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu 
610 615 620 

gaa gga aca gta tgg ace gaa att gat gat aca aaa gtc ttc aaa att 2042 
Glu Gly Thr Val Trp Thr Glu He Asp Asp Thr Lys Val Phe Lys He 
625 630 635 

eta gat ctt gaa gac ctg gaa aga ace ttc tct gec tat caa aga cag 2090 
Leu Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gin Arg Gin 
640 645 . 650 655 

cag gat ttc ttt gtg aac agt aac tec aag cag aaa gaa gca gat gec 2138 
Gin Asp Phe Phe Val Asn Ser Asn Ser Lys Gin Lys Glu Ala Asp Ala 
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660 '665 670 

att gat gac act ctg agt. tec aaa ctt aaa gtt aaa gag ctt teg gtg 2186 
He Asp Asp Thr Leu Ser Ser Lys Leu Lys Val Lys Glu Leu Ser Val 
675 680 685 

att gat ggt egg aga get cag aat tgc aac ate ctt eta teg agg ttg 2234 
He Asp Gly Arg Arg Ala Gin Asn Cys Asn lie Leu Leu Ser Arg Leu 
690 695 700 

aaa tta tec aat gac gaa ate aaa egg gca att eta aca atg gac gaa 2282 
Lys Leu Ser Asn Asp Glu He Lys Arg Ala He Leu Thr Met Asp Glu 
705 710 715 

cag gaa gat ctg ccc aag gac atg ttg gaa cag etc ttg aaa ttt gtt 2330 
Gin Glu Asp Leu Pro Lys Asp Met Leu Glu Gin Leu Leu Lys Phe Val 
720 . 725 730 735 

cct gaa aaa agt gac att gac eta ttg gag gaa cat aaa cac gaa ctg 2378 
Pro Glu Lys Ser Asp He Asp Leu Leu Glu Glu His Lys His Glu Leu 
740 745 750 

gat egg atg gec aag get gat agg ttc ctt ttt gag atg age cga att 2426 
Asp Arg Met Ala Lys Ala Asp Arg Phe Leu Phe Glu Met Ser Arg He 
755 760 765 

- aat cac tat cag caa agg ttg caa teg ctg tac ttc aaa aag aag ttt 2474 
Asn His Tyr Gin Gin Arg Leu Gin Ser Leu Tyr Phe Lys Lys Lys Phe 
770 775 780 

gca gag cgt gtg gca gaa gtg aaa cct aaa gtg gaa gca att cgt tct 2522 
Ala Glu Arg Val Ala Glu Val Lys Pro Lys Val Glu Ala He Arg Ser 
785 790 795 

ggc tea gaa gag gtg ttt agg agt ggt gee etc aag cag ttg ctg gag 2570 
Gly Ser Glu Glu Val Phe Arg Ser Gly Ala Leu Lys Gin Leu Leu Glu 
BOO 805 810 ~ 815 

gtg gtt ttg gca ttt gga aat tat atg aat aaa ggt caa aga ggg aat 2618 
Val Val Leu Ala Phe Gly Asn Tyr Met Asn Lys Gly Gin Arg Gly Asn 
820 825 830 

gca tat gga ttc aag ata tct age eta aac aaa att get gac aca aaa 2666 
Ala Tyr Gly Phe Lys He Ser Ser Leu Asn Lys He Ala Asp Thr Lys 
835 840 845 

tec age ate gac aaa aac att acc ctt ttg cac tat etc ate act att 2714 
Ser Ser He Asp Lys Asn He Thr Leu Leu His Tyr Leu He Thr He 
850 855 " 860 

gtg gaa aat aag tac ccc agt gtt etc aat eta aat gaa gaa ttg cga 2762 
Val Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu Asn Glu Glu Leu Arg 
865 870 875 

gat att cct caa get gcg aaa gta aac atg act gag ctg gac aaa gaa 2810 
Asp He Pro Gin Ala Ala Lys Val Asn Met Thr Glu Leu Asp Lys Glu 
880 885 890 895 

ata agt acc ttg aga agt ggc ttg aaa gca gta gag aca gag ctg gaa 2858 
He Ser Thr Leu Arg Ser Gly Leu Lys Ala Val Glu Thr Glu Leu Glu 
900 905 910 

tat cag aag tct cag ccc cca cag ccc gga gat aag ttt gtg tct gtt 2906 
Tyr Gin Lys Ser Gin Pro Pro Gin Pro Gly Asp Lys Phe Val Ser Val 
915 920 ** 925 

gtc age cag ttc ate aca gta gee age ttc age ttc tct gat gtt gaa 2954 
Val Ser Gin Phe He Thr Val Ala Ser Phe Ser Phe Ser Asp Val Glu 
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930 



935 



940 



gac ctt eta gca gaa get aaa gac ctg ttt act aaa gca gtg aag cac 3002 
Asp Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr Lys Ala Val Lys His 
945 950 955 

ttt ggg gaa gag get ggc aaa ata caa cca gat gag ttc ttt ggc att 3050 
Phe Gly Glu Glu Ala Gly Lys lie Gin Pro Asp Glu Phe Phe Gly lie 
960 965 970 975 

ttt gat caa ttt ctt caa get gtg tea gaa gee aaa caa gaa aac gaa 3098 
Phe Asp Gin Phe Leu Gin Ala Val Ser Glu Ala Lys Gin Glu Asn Glu 
980 985 990 

aat atg aga aag aaa aag gag gaa gaa gaa cgt cga get cgc atg gaa 3146 
Asn Met Arg Lys Lys Lys Glu Glu Glu Glu Arg Arg Ala Arg Met Glu 
995 1000 1005 

get cag etc aaa gaa caa cgt gaa agg gaa cgt aaa .atg aga aaa 3191 
Ala Gin Leu Lys Glu Gin Arg Glu Arg Glu Arg Lys Met Arg Lys 
1010 1015 * 1020 

get aaa gag aat agt gaa gaa age gga gag ttt gat gac ctt gtt 3236 
Ala Lys Glu Asn Ser Glu Glu Ser Gly Glu Phe Asp Asp Leu Val 
1025 1030 1035 

tea get tta cgc tea gga gaa gtg ttt gac aaa gac ctt tct aaa 3281 
Ser Ala Leu Arg Ser Gly Glu Val Phe Asp Lys Asp Leu Ser Lys 
1040 1045 1050 

ttg aaa egg aat cgc aaa cgt att ace aac cag atg act gac age 3326 
Leu Lys Arg Asn Arg Lys Arg lie Thr Asn Gin Met Thr Asp Ser 
1055 1060 1065 

age aga gag aga cca ate aca aaa ctt aat ttc taattttcca 3369 
Ser Arg Glu Arg Pro lie Thr Lys Leu Asn Phe 
1070 1075 



tgaatacttt 


tttttagaaa 


gctcattagc 


agccctctaa 


agtgactaga 


aegtttcatt 


3429 


acactgcctt 


gcaatccaaa 


cagtggcaat 


tttttccttc 


atctgtgagt 


gaatgtgtga 


3489 


acgtgtgtat 


gtaaatgtat 


gtgtgtatat 


attaaaaaat 


gtatatagat 


gtctgagtgt 


3549 


tgtctggaga 


ectataegta 


tggttaaaaa 


gatttatgtt 


aatgtatgtg 


ctccaaaacc 


3609 


tttcgtgtat 


gcattcacat 


tgagtgtggc 


tcattttctt 


tccccgaacg 


ccatgactgt 


3669 


tcagaagcac 


aatactatct 


cctgaaagag 


ataagagaca 


ttccctagat 


tcaaaggcaa 


3729 


aacagaagaa 


acaaacaaac 


aaacaaacaa 


agettgeaaa 


atattttatg 


gtttccaagc 


3789 


ttgatatcct 


ttaaaattat 


tttcattgat 


ggaactggag 


ttgttggaaa 


aacatagatt 


3849 


taaaatgatt 


tttgatagct 


gacattgtga 


tgttgatgta 


tcacatcagt 


aataggacca 


3909 


gctttgaatt 


tctgacattg 


gtgtggggat 


acagtctgta 


aatgtttatt 


gagaacatct 


3969 


tgcacacaat 


ttgaattatg 


tagaatgtca 


atcaagtttt 


tgtatattta 


aaagttggac 


4029 


atcaattttt 


tcccctgatt 


tcatcaagtt 


atctctgcca 


agtgctcttg 


ataatttctt 


4089 


cagatttttg 


gaaaaaaaca 


ctatataaat 


gcaatccatg 


ctttttttaa 


agaacaacat 


4149 


tgccagagta 


tgcttgttct 


aacaatatag 


atatataaac 


cttaaaaata 


ataaaatatc 


4209 


tcacccaaga 


cttaaaggaa 


gaattctctg 


aagggataaa 


gattact 




4256 
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<210> 


66 


<211> 


1078 


<212> 


PRT 


<213> 


NMJ)14 992 DAAM1 


<400> 


66 



Met Ala Pro Arg Lys Arg Gly Gly Arg Gly He Ser Phe He Phe Cys 
15 10 15 

Cys Phe Arg Asn Asn Asp His Pro Glu He Thr Tyr Arg Leu Arg Asn 
20 25 30 

Asp Ser Asn Phe Ala Leu Gin Thr Met Glu Pro Ala Leu Pro Met Pro 
35 40 45 

Pro Val Glu Glu Leu Asp Val Met Phe ;Ser Glu Leu Val Asp Glu Leu 
50 55 60 
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Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala Glu 
fi 5 70 75 80 



Lys Lys Trp Gin He Tyr Cys Ser Lys Lys Lys Asp Gin Glu Glu Asn 
85 90 95 



Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr He Asp Gin Leu Asn Ser 
100 105 HO 



Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu Glu 
115 120 125 



Glu Arg Ser Lys Thr lie Glu Ser Leu Lys Thr Ala Leu Arg Thr Lys 
130 135 140 



Pro Met Arg Phe Val Thr Arg Phe He Asp Leu Asp Gly Leu Ser Cys 
14 $ 150 155 160 



He Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser Arg 
165 170 175 



He His Thr Ser Leu He Gly Cys He Lys Ala Leu Met Asn Asn Ser 
180 185 190 



Gin Gly Arg Ala His Val Leu Ala His Ser Glu Ser He Asn Val He 
195 200 205 



Ala Gin Ser Leu Ser Thr Glu Asn He Lys Thr Lys Val Ala Val Leu 
210 215 220 
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Glu He Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys Val 
225 230 235 240 

Leu Gin Ala Met Leu His Tyr Gin Lys Tyr Ala. Ser Glu Arg Thr Arg 
245 250 255 

Phe Gin Thr Leu He Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr Arg 
260 265 270 

Asp Glu Val Ser Leu Lys Thr Ala He Met Ser Phe He Asn Ala Val 
2 75 . 280 285 

Leu Ser Gin Gly Ala Gly Val Glu Ser. Leu Asp Phe Arg Leu His Leu 
290 295 300 



Arg Tyr Glu Phe Leu Met Leu Gly He Gin Pro Val He Asp Lys Leu 
305 310 315 320 



Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe Glu 
325 330 335 



Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu Leu 
340 345 350 



Val His He Asp Thr Lys Ser Ala Thr Gin Met Phe Glu Leu Thr Arq 
355 .360 365 

Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser He Leu 
370 375 380 

His His Cys Leu Gin Met Pro Tyr Lys Arg Ser Gly Asn Thr Val Gin 
385 390 395 400 

Tyr Trp Leu Leu Leu Asp Arg He He Gin Gin He Val He Gin Asn 
405 410 415 



Asp Lys Gly Gin Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn He 
420 425 430 



Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gin Trp 
435 440 445 



Lys Glu Gin Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gin Gin 
450 .455 460 



Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gin Glu Lys 
465 470 475 480 



Glu Glu Met Met Gin Thr Leu Asn Lys Met Lys Glu Lys Leu Glu Lys 
485 490 ^ 495 
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Glu Thr Thr Glu His Lys Gin Val. Lys Gin Gin Val Ala Asp Leu Thr 
500 505 510 



Ala Gin Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser He Pro 
515 520 525 



Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser Val 
530 535 540 



Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly Gly 
545 550 555 560 



Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro Pro 
565 570 ~ 575 



Pro Pro Gly Pro Pro Pro Leu Gly Ala He Met Pro Pro Pro Gly Ala 
580 585 590 



Pro Met Gly Leu Ala Leu Lys Lys Lys Ser He Pro Gin Pro Thr Asn 
595 600 605 



Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu Glu 
610 615 620 



Gly Thr Val Trp Thr Glu He Asp Asp Thr Lys Val Phe Lys He Leu 
625 630 635 640 



Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gin Arg Gin Gin 
645 650 ' 655 



Asp Phe Phe Val Asn Ser Asn Ser Lys Gin Lys Glu Ala Asp Ala He 
660 665 " 670 



Asp Asp Thr Leu Ser Ser Lys Leu Lys Val Lys Glu Leu Ser Val He 
675 680 685 



Asp Gly Arg Arg Ala Gin Asn Cys Asn He Leu Leu Ser Arg Leu Lys 
690 695 700 



Leu Ser Asn Asp Glu He Lys Arg Ala He Leu Thr Met Asp Glu Gin 
™ 5 710 715 720 



Glu Asp Leu Pro Lys Asp Met Leu Glu Gin Leu Leu Lys Phe Val Pro 
725 730 ' 735 



Glu Lys Ser Asp He Asp Leu Leu Glu Glu His Lys His Glu Leu Asp 
740 745 750 



Arg Met Ala Lys Ala Asp Arg Phe Leu Phe Glu Met Ser Arg He Asn 
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755 760 765 



His Tyr Gin Gin Arg Leu Gin Ser Leu Tyr Phe Lys Lys Lys Phe Ala 
770 775 ~ 780 



Glu Arg Val Ala Glu Val Lys Pro Lys Val Glu Ala lie Arg Ser Gly 
785 790 795 800 



Ser Glu Glu Val Phe Arg Ser Gly Ala Leu Lys Gin Leu Leu Glu Val 
805 810 815 



Val Leu Ala Phe Gly Asn Tyr Met Asn Lys Gly Gin Arg Gly Asn Ala 
820 825 830 



Tyr Gly Phe Lys He Ser Ser Leu Asn. Lys. He Ala Asp Thr Lys Ser 
835 840 845 



Ser He Asp Lys Asn He Thr Leu Leu His Tyr Leu He Thr He Val 
850 855 860 



Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu Asn Glu Glu Leu Arg Asp 
865 870 875 880 



He Pro Gin Ala Ala Lys Val Asn Met Thr Glu Leu Asp Lys Glu He 
885 890 895 



Ser Thr Leu Arg Ser Gly Leu Lys Ala Vai Glu Thr Glu Leu Glu Tyr 
900 905 910 



Gin Lys Ser Gin Pro Pro Gin Pro Gly Asp Lys Phe Val Ser Val Val 
915 920 925 



Ser Gin Phe He Thr Val Ala Ser Phe Ser Phe Ser Asp Val Glu Asp 
930 935 940 



Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr Lys Ala Val Lys His Phe 
945 950 955 960 



Gly Glu Glu Ala Gly Lys He Gin Pro Asp Glu Phe Phe Gly He Phe 
965 970 975 



Asp Gin Phe Leu Gin Ala Val Ser Glu Ala Lys Gin Glu Asn Glu Asn 
980 985 * 990 



Met Arg Lys Lys Lys Glu Glu Glu Glu Arg Arg Ala Arg Met Glu Ala 
995 1000 1005 



Gin Leu Lys Glu Gin Arg Glu Arg Glu Arg Lys Met Arg Lys Ala 
1010 1015 1020 



Lys Glu Asn Ser Glu Glu Ser * Gly Glu Phe Asp Asp Leu Val Ser 
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1025 1030 



1035 



Ala Leu Arg Ser Gly Glu Val Phe Asp Lys Asp Leu Ser Lys Leu 
!040 1045 1050 

Lys Arg Asn Arg Lys Arg lie Thr Ash Gin Met Thr Asp Ser Ser 
1Q 55 1060 1065 

Arg Glu Arg Pro lie Thr Lys Leu Asn Phe 
1070 1075 

<210> 67 • 

<211> 1096 

<212> DNA 

<213> NM_002122 MHC class II DQ alpha 1/ HLA-DQA1 

<220> 
<221> CDS 
<222> <1)..<765) 
<223> 

<400> 67 

atg ate eta aac aaa get ctg ctg ctg ggg gee etc get ctg ace ace 
Met lie Leu Asn Lys Ala Leu Leu Leu Gly Ala Leu Ala Leu Thr Thr 
1 ? 10 15 

gtg atg age ccc tgt gga ggt gaa gac att gtg get gac cac gtt gee 
Val Met Ser Pro Cys Gly Gly Glu Asp He Val Ala Asp His Val Ala 
20 25 30 

tct tgt ggt gta aac ttg tac cag ttt tac ggt ccc tct ggc cag tac 144 
Ser Cys Gly Val Asn Leu Tyr Gin Phe Tyr Gly Pro Ser Gly Gin Tyr 
35 40 45 

acc cat gaa ttt gat gga gat gag cag ttc tac gtg gac ctg gag agg 192 
Thr His Glu Phe Asp Gly Asp Glu Gin Phe Tyr Val Asp Leu Glu Arg 
50 55 60 

aag gag act gee tgg egg tgg cct gag ttc age aaa ttt gga ggt ttt 240 
Lys Glu Thr Ala Trp Arg Trp Pro Glu Phe Ser Lys Phe Gly Gly Phe 
65 70 75 80 

gac ccg cag ggt gca ctg aga aac atg get gtg gca aaa cac aac ttg 288 
Asp Pro Gin Gly Ala Leu Arg Asn Met Ala Val Ala Lys His Asn Leu 
85 90 95 

aac ate atg att aaa cgc tac aac tct acc get get acc aat gag gtt 336 
Asn lie Met He Lys Arg Tyr Asn Ser Thr Ala Ala Thr Asn Glu Val 
100 105 no 

cct gag gtc aca gtg ttt tec aag tct ccc gtg aca ctg ggt cag ccc 384 
Pro Glu Val Thr Val Phe Ser Lys Ser- Pro Val Thr Leu Gly Gin Pro 
115 120 125 



48 



96 
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aac acc etc att tgt ctt gtg gac aac ate ttt cct cct gtg gtc aac 
Asn Thr Leu He Cys Leu Val Asp Asn lie Phe Pro Pro Val Val Asn 
130 135 140 



ate aca tgg ctg age aat ggg cag tea gtc aca gaa ggt gtt tct gag 
He Thr Trp Leu Ser Asn Gly Gin Ser Val Thr Glu Gly Val Ser Glu 
145 150 155 i 60 



<210> 


68 


<211> 


255 


<212> 


PRT 


<213> 


nmj: 


<400> 


68 



Met He Leu Asn Lys Ala Leu Leu Leu Gly Ala Leu Ala Leu Thr Thr 
1 5 10 15 

Val Met Ser Pro Cys Gly Gly Glu Asp He Val Ala Asp His Val Ala 
20 25 30 



432 



480 



acc age ttc etc tec aag agt gat cat tec ttc ttc aag ate agt tac 528 
Thr Ser Phe Leu Ser Lys Ser Asp His Ser Phe Phe Lys He Ser Tyr 
165 170 175 

etc acc ttc etc cct tct get gat gag att tat gac tgc aag gtg gag 576 
Leu Thr Phe Leu Pro Ser Ala Asp Glu He Tyr Asp Cys Lys Val Glu 
180 185 190 

cac tgg ggc ctg gac cag cct ctt ctg aaa cac tgg gag cct gag att 
His Trp Gly Leu Asp Gin Pro Leu Leu Lys His Trp Glu Pro Glu He 
195 200 205 

cca gee cct atg tea gag etc aca gag act gtg gtc tgt gec ctg ggg 
Pro Ala Pro Met Ser Glu Leu Thr Glu Thr Val Val Cys Ala Leu Gly 
210 215 220- 

ttg tct gtg ggc etc atg ggc att gtg gtg ggc act gtc ttc ate ate 
Leu Ser Val Gly Leu Met Gly He Val Val Gly Thr Val Phe He He 
225 230 235 240 

caa ggc ctg cgt tea gtt ggt get tec aga cac caa ggg cca ttg 765 
Gin Gly Leu Arg Ser Val Gly Ala Ser Arg His Gin Gly Pro Leu 
245 250 255 

tgaatcccat cctggaaggg aaggtgcatc gccatctaca ggagcagaag aatggacttg 825 

ctaaatgacc tagcactatt ctctggcccg atttatcata tcccttttct cctccaaata 885 

tttctcctct caccttttct ctgggactta agetgetata tcccctcaga gctcacaaat 945 

gectttacat tctttccctg acctcctgat tttttttttc ttttctcaaa tgttacctac 1005 

aatacatgee tggggtaagc cacccggcta cctaattcct cagtaacctc catctaaaat 1065 

ctccaaggaa gcaataaatt ccttttatga g i096 



624 



672 



720 



Ser Cys Gly Val Asn Leu Tyr Gin Phe Tyr Gly Pro Ser Gly Gin Tyr 
35 40 45 
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Thr His Glu Phe Asp Gly Asp Glu Gin Phe Tyr Val Asp Leu Glu Arg 
50 55 go 

Lys Glu Thr Ala Trp Arg Trp Pro Glu Phe Ser Lys Phe Gly Gly Phe 
bS 70 75 80 

Asp Pro Gin Gly Ala Leu Arg Asn Met Ala Val Ala Lys His Asn Leu 
85 90 95 

Asn He Met He Lys Arg Tyr Asn Ser Thr Ala Ala Thr Asn Glu Val 
100 105 no 

Pro Glu Val Thr Val Phe Ser Lys Ser Pro Val Thr Leu Gly Gin Pro 
115 120 125 

Asn Thr Leu lie Cys Leu Val Asp Asn lie Phe Pro Pro Val Val Asn 
1JU 135 140 

lie Thr Trp Leu Ser Asn Gly Gin Ser Val Thr Glu Gly Val Ser Glu 
45 150 ' 155 i 6 o 

Thr Ser Phe Leu Ser Lys Ser Asp His Ser Phe Phe Lys He Ser Tyr 
165 170 175 

Leu Thr Phe Leu Pro Ser Ala Asp Glu lie Tyr Asp Cys Lys Val Glu 
180 185 i 9 o 

His Trp Gly Leu Asp Gin Pro Leu Leu Lys His Trp Glu Pro Glu He 
195 200 205 

Pro Ala Pro Met Ser Glu Leu Thr Glu Thr Val Val Cys Ala Leu Gly 

215 220 

Leu Ser Val Gly Leu Met Gly He Val Val Gly Thr Val Phe He He 
230 235 240 

Gin Gly Leu Arg Ser Val Gly Ala Ser Arg His Gin Gly Pro Leu 
245 250 " 255 

<210> 69 
<211> 2820 
<212> DNA 

<213> NM_003014 SFRP4, secreted frizzled-related protein 4 



<220> 

<221> CDS 



* > ) 
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<222> (238) (1275) 
<223> 



<400> 69 

ggcgggttcg cgccccgaag gctgagagct ggcgctgctc gtgccctgtg tgccagacgg 60 

cggagctccg cggccggacc ccgcggcccc gctttgctgc cgactggagt ttgggggaag 120 

aaactctcct gcgccccaga agatttcttc ctcggcgaag ggacagcgaa agatgagggt 180 

ggcaggaaga gaaggcgctt tctgtctgcc ggggtcgcag cgcgagaggg cagtgcc 237 

atg ttc etc tec ate eta gtg gcg ctg tgc ctg tgg ctg cac ctg gcg 285 
Met Phe Leu Ser He Leu Val Ala Leu Cys Leu Trp Leu His Leu Ala 
1 5 10 15 

ctg ggc gtg cgc ggc gcg ccc tgc gag gcg gtg cgc ate cct atg tgc 333 
Leu Gly Val Arg Gly Ala Pro Cys Glu Ala Val Arg He Pro Met Cys 
20 25 30 

egg cac atg ccc tgg aac ate acg egg atg ccc aac cac ctg cac cac 381 
Arg Hxs Met Pro Trp Asn He Thr Arg Met Pro Asn His Leu His His 
35 40 45 

age acg cag gag aac gec ate ctg gec ate gag cag tac gag gag ctg 429 
Ser Thr Gin Glu Asn Ala He Leu Ala He Glu Gin Tyr Glu Glu Leu 
50 '55 60 

gtg gac gtg aac tgc age gec gtg ctg cgc ttc ttc ttc tgt gec atg 477 
Val Asp Val Asn Cys Ser Ala Val Leu Arg Phe Phe Phe Cys Ala Met 
65 70 75 80 

tac gcg ccc att tgc acc ctg gag ttc ctg cac gac cct ate aag ccg 525 
Tyr Ala Pro He Cys Thr Leu Glu Phe Leu His Asp Pro He Lys Pro 
85 90 95 

tgc aag teg gtg tgc caa cgc gcg cgc gac gac tgc gag ccc etc atg 573 
Cys Lys Ser Val Cys Gin Arg Ala Arg Asp Asp Cys Glu Pro Leu Met 
100 105 no 

aag atg tac aac cac age tgg ccc gaa age ctg gee tgc gac gag ctg 621 
Lys Met Tyr Asn His Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu 
115 120 125 



cct gtc tat gac cgt ggc gtg tgc att teg cct gaa gec ate gtc acg 
Pro Val Tyr Asp Arg Gly Val Cys He Ser Pro Glu Ala He Val Thr 
130 135 140 



A « 200 205 

agt ggc tgc aat gag gtc aca acg gtg gtg gat gta aaa gag ate ttc 



669 



gac etc ccg gag gat gtt aag tgg ata gac ate aca cca gac atg atg 717 
Asp Leu Pro Glu Asp Val Lys Trp He Asp He Thr Pro Asp Met Met 
145 150 155 160 

gta cag gaa agg cct ctt gat gtt gac tgt aaa cgc eta age ccc gat 765 
Val Gin Glu Arg Pro Leu Asp Val Asp Cys Lys Arg Leu Ser Pro Asp 
165 170 175 

egg tgc aag tgt aaa aag gtg aag cca act ttg gca acg tat etc age 813 
Arg Cys Lys Cys Lys Lys Val Lys Pro Thr Leu Ala Thr Tyr Leu Ser 
18° 185 190 

aaa aac tac age tat gtt att cat gec aaa ata aaa get gtg cag agg 861 
Lys Asn Tyr Ser Tyr Val He His Ala Lys He Lys Ala Val Gin Arg 
195 200 205 



909 
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957 



1005 



1053 



1101 



1197 



1245 



1295 



Ser Gly Cys Asn Glu Val Thr Thr Val Val Asp Val Lys Glu He Phe 
210 215 • 220 

aag tec tea tea ccc. ate cct cga act caa gtc ccg etc att aca aat 
Lys Ser Ser Ser Pro He Pro Arg Thr Gin Val Pro Leu He Thr Asn 
225 230 235 . 240 

tct tct tgc cag tgt cca cac ate ctg ccc cat caa gat gtt etc ate 
Ser Ser Cys Gin Cys Pro His He Leu Pro His Gin Asp Val Leu He 
245 250 255 

atg tgt tac gag tgg cgt tea agg. atg atg ctt ctt gaa aat tgc tta 
Met Cys Tyr Glu Trp Arg Ser Arg Met Met Leu Leu Glu Asn Cys Leu 
260 265 270 

gtt gaa aaa tgg aga gat cag ctt agt aaa aga tec ata cag tgg gaa 
Val Glu Lys Trp Arg Asp Gin Leu Ser Lys Arg Ser He Gin Trp Glu 
275 280 285 

gag agg ctg cag gaa cag egg aga aca gtt cag gac aag aag aaa aca 1149 
Glu Arg Leu Gin Glu Gin Arg Arg Thr Val Gin Asp Lys Lys Lys Thr 
290 . 295 * 300 

gee ggg cgc ace agt cgt agt aat ccc ccc aaa cca aag gga aag cct 
Ala Gly Arg Thr Ser Arg Ser Asn Pro Pro Lys Pro Lys Gly Lys Pro 
305 310 315 320 

cct get ccc aaa cca gee agt ccc aag aag aac att aaa act agg agt 
Pro Ala Pro Lys Pro Ala Ser Pro Lys Lys Asn He Lys Thr Arg Ser 
325 330 335 

gee cag aag aga aca aac ccg aaa aga gtg tgagctaact agtttccaaa 
Ala Gin Lys Arg Thr Asn Pro Lys Arg Val 
340 345 

geggagaett ccgacttcct tacaggatga ggctgggcat tgcctgggac agcctatgta 1355 

aggccatgtg ccccttgccc taacaactca ctgcagtgct cttcatagac acatcttgea 1415 

gcatttttct taaggctatg cttcagtttt tctttgtaag ccatcacaag ccatagtggt 1475 

aggtttgece tttggtacag aaggtgagtt • aaagctggtg gaaaaggctt attgeattge 1535 

attcagagta acctgtgtgc atactctaga agagtaggga aaataatget tgttacaatt 1595 

cgacctaata tgtgcattgt aaaataaatg ccatatttca aacaaaacac gtaatttttt 1655 

tacagtatgt tttattacct tttgatatct gttgttgcaa tgttagtgat gttttaaaat 1715 

gtgatgaaaa tataatgttt ttaagaagga acagtagtgg aatgaatgtt aaaagatctt 1775 

tatgtgttta tggtctgcag aaggattttt gtgatgaaag gggatttttt gaaaaattag 1835 

agaagtagca tatggaaaat tataatgtgt ttttttacca atgacttcag tttctgtttt 1895 

tagctagaaa cttaaaaaca aaaataataa taaagaaaaa taaataaaaa ggagaggcag 1955 

acaatgtctg gattcctgtt ttttggttac ctgatttcca tgatcatgat gcttcttgtc 2015 

aacaccctct taagcagcac cagaaacagt gagtttgtct gtaccattag gagttaggta 2075 

ctaattagtt ggctaatget caagtatttt atacccacaa gagaggtatg tcactcatct 2135 

tacttcccag gacatccacc ctgagaataa tttgacaagc ttaaaaatgg ccttcatgtg 2195 

agtgccaaat tttgtttttc ttcatttaaa tattttcttt gectaaatae atgtgagagg 2255 

agttaaatat aaatgtacag agaggaaagt tgagttccac ctctgaaatg agaattactt 2315 
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gacagttggg atactttaat cagaaaaaaa gaacttattt gcagoatttt atcaacaaat 2375 

ttcataattg tggacaattg gaggcattta ttttaaaaaa caattttatt ggccttttgc 2435 

taacacagta agcatgtatt ttataaggca ttcaataaat gcacaacgcc caaaggaaat 2495 

aaaatcctat ctaatcctac tctccactac aoagaggtaa tcactattag tattttggca 2555 

tattattctc caggtgtttg cttatgcact tataaaatga tttgaacaaa taaaactagg 2615 

aacctgtata catgtgtttc ataacctgcc tcctttgctt ggcqctttat tgagataagt 2675 

tttcctgtca agaaagcaga aaccatctca tttctaacag ctgtgttata ttccatagta 2735 

tgcattactc aacaaactgt tgtgctattg gatacttagg tggtttcttc actgacaata 2795 
ctgaataaac atctcaccgg aattc 



<210> 70 
<211> 346 
<212> PRT 

<213> NM_003014 SFRP4, secreted f rizzled-related protein 4 
<400> 70 

Met Phe Leu Ser lie Leu Val Ala Leu Cys Leu Trp Leu His Leu Ala 



10 



15 



Leu Gly val Arg Gly Ala Pro Cys Glu Ala Val Arg He Pro Met Cys 

25 3q 

Arg His Met Pro Trp Asn He Thr Arg Met- Pro Asn His Leu His His 

40 45 

Ser Thr Gin Glu Asn Ala He Leu Ala lie Glu Gin Tyr Glu Glu Leu 

55 60 

Val Asp Val Asn Cys Ser Ala Val Leu Arg Phe Phe Phe Cys Ala Met 
70 ' 75 go 

Tyr Ala Pro lie Cys Thr Leu Glu Phe Leu His Asp Pro He Lys Pro 
85 90 95 

Cys Lys Ser yal Cys Gin Arg Ala Arg Asp Asp Cys Glu Pro Leu Met 
i0U 105 110 

I*s Met Tyr Asn His Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu 
1J - 3 120 12 s 

Pro yal Tyr Asp Arg Gly Val Cys He Ser Pro Glu Ala He Val Thr 

135 140 



2820 
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Asp Leu Pro Glu Asp Val Lys Trp He Asp lie Thr Pro Asp Met Met 
145 150 155 leo 

Val Gin Glu Arg Pro Leu Asp Val Asp Cys Lys Arg Leu Ser Pro Asp 
165 170 175 

Arg Cys Lys Cys Lys Lys Val Lys Pro Thr Leu Ala Thr Tyr Leu Ser 
x ?° ^ 185 190 

Lys Asn Tyr Ser Tyr Val He His Ala Lys lie Lys Ala Val Gin Arg 
195 200 205 

Ser Gly Cys Asn Glu Val Thr Thr Val Val Asp Val Lys Glu lie Phe 
210 215 220 



Lys Ser Ser Ser Pro He Pro Arg Thr Gin Val Pro Leu He Thr 



225 230 



Asn 



235 240 



Ser Ser Cys Gin Cys Pro His He Leu Pro His Gin Asp Val Leu He 
245 250 255 

Met Cys Tyr Glu Trp Arg Ser Arg Met Met Leu Leu Glu Asn Cys Leu 
260 265 270 

Val Glu Lys Trp Arg Asp Gin Leu Ser Lys Arg Ser He Gin Trp Glu 
z ' h 280 285 

Glu Arg Leu Gin Glu Gin Arg Arg Thr Val Gin Asp Lys Lys Lys Thr 
290 295 300 

Ala Gly Arg Thr Ser Arg Ser Asn Pro Pro Lys Pro Lys Gly Lys Pro 
310 315 320 

Pro Ala Pro Lys Pro Ala Ser Pro Lys Lys Asn He Lys Thr Arg Ser 
325 330 335 

Ala Gin Lys Arg Thr Asn Pro Lys Arg Val 
340 345 

<210> 71 
<2U> 1362 
<212> DNA 

<213> NM_004039 annexin A2 



<220> 
<221> 
<222> 



CDS 

(50).. (1066) 
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<223> 



<400> 71 

catttgggga cgctctcagc tctcggcgca cggcccagct tccttcaaa atg tct act 58 

Met Ser Thr 
1 

gtt cac gaa ate ctg tgc aag etc age ttg gag ggt gat cac tct aca 106 
vai His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser Thr 
5 10 15 

ccc cca agt gca tat ggg tct gtc aaa gee tat act aac ttt gat get 154 
Pro Pro Ser Ala Tyr Gly Ser Val Lys Ala Tyr Thr. Asn Phe Asp Ala 
20 25 30 35 

gag egg gat get ttg aaa att gaa aca gec ate aag ace aaa ggt gtg 202 
Glu Arg Asp Ala Leu Asn He Glu Thr Ala He Lys Thr Lys Gly Val 
4 ° 45 50 

gat gag gtc ace att gtc aac att ttg ace aac cgc age aat gca cag 250 
Asp Glu Val Thr He Val Asn He Leu Thr Asn Arg Ser Asn Ala Gin 
55 60 65 

aga cag gat att gec ttc gee tac cag aga agg ace aaa aag gaa ctt 298 
Arg Gin Asp lie Ala Phe Ala Tyr Gin Arg Arg Thr Lys Lys Glu Leu 
70 75 80 

gca tea gca- ctg aag tea gec tta tct ggc cac ctg gag acg gtg att 346 
Ala Ser Ala Leu Lys Ser Ala Leu Ser Gly His Leu Glu Thr Val He 
85 90 95 

ttg ggc eta ttg aag aca cct get cag tat gac get tct gag eta aaa 394 
Leu Gly Leu Leu Lys Thr Pro Ala Gin Tyr Asp Ala Ser Glu Leu Lys 
100 105 no 1 J 5 

get tec atg aag ggg ctg gga acc gac gag gac tct etc att gag ate 442 
Ala Ser Met Lys Gly Leu Gly Thr Asp Glu Asp Ser Leu He Glu He 
120 125 . 130 

ate tgc tec aga acc aac cag gag ctg cag gaa attr aac aga gtc tac 490 
He Cys Ser Arg Thr Asn Gin Glu Leu Gin Glu He Asn Arg Val Tyr 
135 140 - 145 

aag gaa atg tac aag act gat ctg gag aag gac att att teg gac aca 538 
Lys Glu Met Tyr Lys Thr Asp Leu Glu Lys Asp He He Ser Asp Thr 
150 155 160 

til r° »u C * 9 ° aag Ctg atg St* gcc ct * 9 ca aa S aga aga 586 
Ser Gly Asp Phe Arg Lys Leu Met Val Ala Leu Ala Lys Gly Arg Arg 
165 170 175 

gca gag gat ggc tct gtc att gat tat gaa ctg att gac caa gat get 634 
Ala Glu Asp Gly Ser Val He Asp Tyr Glu Leu He Asp Gin Asp Ala 
180 185 190 195 

egg gat etc tat gac get gga gtg aag agg aaa gga act gat gtt ccc 682 
Arg Asp Leu Tyr Asp Ala Gly Val Lys Arg Lys Gly Thr Asp Val Pro 
200 205 210 

aag tgg ate age ate atg acc gag egg age gtg ccc cac etc cag aaa 730 
Lys Trp He Ser He Met Thr Glu Arg Ser Val Pro His Leu Gin Lys 
215 220 225 

gta ttt gat agg tac aag agt tac age cct tat gac atg ttg gaa age 778 
Val Phe Asp Arg Tyr Lys Ser Tyr Ser Pro Tyr Asp Met Leu Glu Ser 
230 235 240 
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ate agg aaa gag gtt aaa gga gac ctg gaa aat get ttc ctg aac ctg 626 
lie Arg Lys Glu Val Lys Gly Asp Leu Glu Asn Ala Phe Leu Asn Leu 
245 250 .255 

Si Gin rlt % g CC ° Ctg tat ttt gCt gat ct 9 tat 874 

Val Gin Cys He Gin Asn Lys Pro Leu Tyr Phe Ala Asp Arg Leu Tyr 

260 265 270 275 

gac tec atg aag ggc aag ggg acg c'ga gat aag gtc ctg ate aga ate 922 
Asp Ser Met Lys Gly Lys Gly Thr Arg Asp LyI Val Leu He Arg lie 
280 285 290 

tit ^ o CC f 9C &9t 9aa gtg gac atg ttg aaa att a 99 tot gaa ttc 970 
Met Val Ser Arg Ser Glu Val Asp Met Leu Lys lie Arg Ser Glu Phe 

295 300 . 305 

III tin tII 2? C f 39 tC ° Ctg taC tat tat atc oag caa 9 ac a <* 1018 
Lys Arg Lys Tyr Gly Lys Ser Leu Tyr Tyr Tyr He Gin Gin Asp Thr 

310 315 320 

aag ggc gac tac cag aaa gcg ctg ctg tac ctg tgt ggt gga gat gac 1066 
Lys Gly Asp Tyr Gin Lys Ala Leu Leu Tyr Leu Cys Ily lly Asp Isp 
J " 330 335 

tgaagecega cacggcctga gcgtceagaa atggtgetca ccatgcttcc agctaacagg 1126 

tctagaaaac cagcttgega ataacagtcc ccgtggccat ccctgtgagg gtgaegttag 1186 

cattaccccc aacctcattt tagttgecta ageattgect ggccttcctg tctagtctct 124 6 

cctgtaagcc aaagaaatga acattccaag gagttggaag tgaagtctat gatgtgaaac 1306 

actttgcctc ctgtgtactg tgtcataaac agatgaataa actgaatttg- tacttt 1362 

<210> 72 

<211> 339 

<212> PRT 

<213> NM_004039 annexln A2 



<400> 72 

Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp 
1 5 10-15 

His Ser Thr Pro Pro Ser Ala Tyr Gly Ser Val Lys Ala Tyr Thr Asn 
20 25 30 

Phe Asp Ala Glu Arg Asp Ala Leu Asn He Glu Thr Ala He Lys Thr 
35 40 45 

Lys Gly Val Asp Glu Val Thr He Val Asn He Leu Thr Asn Arg Ser 
50 55 60 

Asn Ala Gin Arg Gin Asp He Ala Phe Ala Tyr Gin Arg Arg Thr Lys 
65 70 75 80 
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Lys Glu- Leu Ala Ser Ala Leu Lys Ser Ala Leu Ser Gly His Leu Glu 
85 .90 95 



Thr Val lie Leu Gly Leu Leu Lys Thr Pro Ala Gin Tyr Asp Ala Ser 
100 105 HO 



Glu Leu Lys Ala Ser Met Lys Gly Leu Gly Thr Asp Glu Asp Ser Leu 
115 120 125 



He Glu He He Cys Ser Arg Thr Asn Gin Glu Leu Gin Glu He Asn 
130 135 140 



Arg Val Tyr Lys Glu Met Tyr Lys Thr Asp Leu Glu Lys Asp He 'lie 
3-45 150 . 155 ' 160 



Ser Asp Thr Ser Gly Asp Phe Arg Lys Leu Met Val Ala Leu Ala Lys 
165 170 175 



Gly Arg Arg Ala Glu Asp Gly Ser Val lie Asp Tyr Glu Leu He Asp 
180 185 190 



Gin Asp Ala Arg Asp Leu Tyr Asp Ala Gly Val Lys Arg Lys Gly Thr 
195 200 205 



Asp Val .Pro Lys Trp He Ser He Met Thr Glu Arg Ser Val Pro His 
210 215 « 220 



Leu Gin Lys Val Phe Asp Arg Tyr Lys Ser Tyr Ser Pro Tyr Asp Met 
225 230 235- ~ 240 



Leu Glu Ser He Arg Lys Glu Val Lys Gly Asp Leu Glu Asn Ala Phe 
245 250 255 



Leu Asn Leu Val Gin Cys He Gin Asn Lys Pro Leu Tyr Phe Ala Asp 
260 265 270 



Arg Leu Tyr Asp Ser Met Lys Gly Lys Gly Thr Arg Asp Lys Val Leu 
275 280 285 



He Arg He Met Val Ser Arg Ser Glu Val Asp Met Leu Lys He Arg 
290 295 300 



Ser Glu Phe Lys Arg Lys Tyr Gly Lys Ser Leu Tyr Tyr Tyr He Gin 
305 310 315 320 



Gin Asp Thr Lys Gly Asp Tyr Gin Lys Ala Leu Leu Tyr Leu Cys Gly 
325 330 335 



Gly Asp Asp 
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<210> 73 

<211> 850 

<212> DNA 

<213> NM_003955 SOCS3 
<220> 

<221> CDS 

<222> (107) (781) 

<223> 



<400> 73 

gcgccttcct ctccgcagcc ccccgggatg cggtagcggc cgctgtgcgg aggccgcgaa 60 

gcagctgcag ccgccgccgc gcagatccac gctggctccg- tgcgcc atg gtc acc 115 

Met Val Thr 
1 

cac age aag ttt ccc gec gec ggg atg age cgc ccc ctg gac acc age 163 
His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu Asp Thr Ser 
5 10 .15 

ctg cgc etc aag acc ttc age tec aag age gag tac cag ctg gtg gtg 211 
Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser. Glu Tyr Gin Leu Val Val 
20 25- -30 35 

aac gca gtg cgc aag ctg cag gag age ggc ttc tac tgg age gca gtg 259 
Asn Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp Ser Ala Val 
40 45 50 

acc ggc ggc gag gcg aac ctg ctg etc agt gee gag ccc gee ggc acc 307 
Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro Ala Gly Thr 
55 60 65 

ttt ctg ate cgc gac age teg gac cag cgc cac ttc ttc acg etc age 355 
Phe Leu He Arg Asp Ser Ser Asp Gin Arg His Phe Phe Thr Leu Ser 
70 75 80- 

gtc aag acc cag tct ggg acc aag aac .ctg cgc ate cag tgt gag ggg 403 
Val Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg He Gin Cys Glu Gly 
85 90 95 

ggc age ttc tct ctg cag age gat ccc egg age acg cag ccc gtg ccc 451 
Gly Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin Pro Val Pro 
100 105 HO 115 

cgc ttc gac tgc gtg etc aag ctg gtg tac cac tac atg ccg ccc cct 499 
Arg Phe Asp Cys Val Leu Lys Leu Val Tyr His Tyr Met Pro Pro Pro 
120 125 130 

gga gec ccc tec ttc ccc teg cca cct act gaa ccc tec tec gag gtg 547 
Gly Ala Pro Ser Phe Pro Ser Pro Pro Thr Glu Pro Ser Ser Glu Val 
135 140 145 

ccc gag cag ccg tct gec cag cca etc cct ggg agt ccc ccc aga aga 595 
Pro Glu Gin Pro Ser Ala Gin Pro Leu Pro Gly Ser Pro Pro Arg Arg 
150 155 160 

gec tat tac ate tac tec ggg ggc gag aag ate ccc ctg gtg ttg age 643 
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Ala Tyr Tyr lie Tyr Ser Gly Gly Glu Lys lie Pro Leu Val Leu Ser 
165 170 175 

egg ccc etc tec tec aac gtg gee actctt cag cat etc tgt egg aag 691 
Arg Pro Leu Ser Ser Asn Val Ala Thr Leu Gin His Leu Cys Arg Lys 
180 185 190 " 195 

acc gtc aac ggc cac ctg gac tec tat gag aaa gtc acc cag ctg ccg 739 
Thr Val Asn Gly His Leu Asp Ser Tyr Glu Lys Val Thr Gin Leu Pro 
200 205 210 

ggg ccc att egg gag ttc ctg gac cag tac gat gee ccg ctt 781 
Gly Pro lie Arg Glu Phe Leu Asp Gin Tyr Asp Ala Pro Leu 

215 220 " 225 . 

taaggggtaa agggegcaaa gggcatgggt egggagaggg gacgcaggcc cctctcctcc 841 

gtggcacat 850 

<210> 74 

<211> 225 

<212> PRT 

<213> NM 003955 SOCS3 



<400> 74 

Met Val Thr His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu 
15 10 15 



Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gin 
20 25 30 



Leu Val Val Asn Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp 
35 40 45 



Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro 
50 55 60 



Ala Gly Thr Phe Leu lie Arg Asp Ser Ser Asp Gin Arg His Phe Phe 
65 70 75 80 



Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg He Gin 
85 90 95 



Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin 
100 105 110 



Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu Val Tyr His Tyr Met 
115 120 125 



Pro Pro Pro Gly Ala Pro Ser Phe Pro Ser Pro Pro Thr Glu Pro Ser 
130 135 140 
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Ser Glu Val Pro Glu Gin Pro Ser Ala Gin Pro Leu Pro Gly Ser Pro 
145 150 • 155 160 

Pro Arg Arg Ala Tyr Tyr He Tyr Ser Gly Gly Glu Lys He Pro Leu 
165 170 175 

Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala Thr Leu Gin His Leu 
180 185 190 

Cys Arg Lys Thr Val Asn. Gly His Leu Asp Ser Tyr Glu Lys Val Thr 
195 200 205 

Gin Leu Pro Gly Pro He Arg Glu Phe Leu Asp Gin Tyr Asp Ala Pro 
210 215 220 

Leu 
225 

<210> 75 
<211> 369 
<212> DNA 

<213> NM_000331 SAA1, serum amyloid Al 



<220> 

<221> CDS 

<222> <1)..(366) 

<223> 



<400> 75 

atg aag ctt etc acg ggc ctg gtt ttc tgc tec ttg gtc ctg ggt gtc 48 

Met Lys Leu Leu Thr Gly Leu Val Phe Cys Ser Leu Val Leu Gly Val 
1 5 10 15 

age age cga age ttc ttt teg ttc ctt ggc gag get ttt gat ggg get 96 
Ser Ser Arg Ser Phe Phe Ser Phe Leu Gly Glu Ala Phe Is P lly Ala 
20 25 30 

egg gac atg tgg aga gec tac tct gac .atg aga gaa gec aat tac ate 144 
Arg Asp Met Trp Arg Ala Tyr Ser Asp Met Arg Glu Ala Asn Tyr lie 
35 40 45 

ggc tea gac aaa tac ttc cat get egg ggg aac tat gat get gec aaa 192 
Gly Ser Asp Lys Tyr Phe His Ala Arg Gly Asn Tyr Lp ida La Lys 
50 55 60 

agg gga cct ggg ggt gtc tgg get gca gaa gcg ate age gat gec aga 240 
Arg Gly Pro Gly Gly Val Trp Ala Ala Glu Ala He Ser Asp Ala Arg 
65 ™ 75 80 

??n lit n^ 9 f ga ^ C ttfc ggC Cat ggt g °9 ga 9 9ac teg ctg get 288 
Glu Asn He Gin Arg Phe Phe Gly His Gly Ala Glu Asp Ser Leu Ala 
85 90 95 
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gat cag get 


gec aat 


gaa 


Asp Gin Ala 


Ala Asn 


Glu 




100 




ttc cga cct 


get ggc 


ctg 


Phe Arg Pro 


Ala Gly 


Leu 


115 







' yy<- agg agt ggc aaa gac ccc aat cac 336 

> Gly Arg Ser Gly Lys Asp Pro Asn His 
105 no 

- gag aaa tac tga 369 

> Glu Lys Tyr 
120 

<210> 76 ■ 

<2ll> 122 

<212> PRT 

<213> NM_000331 SAA1, serum amyloid Al 

<400> 76 

Met Lys Leu Leu Thr Gly Leu Val Phe Cys Ser Leu Val Leu Gly Val 
1 5 10 15 

Ser Ser Arg Ser Phe Phe Ser Phe Leu Gly Glu Ala Phe Asp Gly Ala 
20 25 30 

Arg Asp Met Trp Arg Ala Tyr Ser Asp Met Arg Glu Ala Asn Tyr lie 
35 40 45 

Gly Ser Asp Lys Tyr Phe His Ala Arg Gly Asn Tyr Asp Ala Ala Lys 
50 55 60 • 

Arg Gly Pro Gly Gly Val Trp Ala Ala Glu Ala He Ser Asp Ala Arg 
65 70 75 80 

Glu Asn lie Gin Arg Phe Phe Gly His Gly Ala Glu Asp Ser Leu Ala 
85 go 95 

Asp Gin Ala Ala Asn Glu Trp Gly Arg Ser Gly Lys Asp Pro Asn His 
100 105 no 



120 





115 


<210> 


77 


<211> 


895 


<212> 


DNA 


<213> 


NMJU4059 RGC32 



<220> 

<221> CDS 
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<222> (147).. (497) 
<223> 



<400> 77 

gcggccgcgt cgaccggcgc ggctggagcg cagcgccgaa gggactggca gggctgaagt 60 

gtgcgggaca gcaagccccc gaatagcccc ggctgccacc tcgcaggacc caaggccacg 120 

cgcgccgggc ccagctgagc cgcctc atg aag ccg ccc gcg gag gac ctg teg ■ 173 

Met Lys Pro Pro Ala Glu Asp Leu Ser 
1 5 

gac gcg ctg tgc gag ttt gac gcg gtg ctg gec gac ttc gcg teg ccc 221 
Asp Ala Leu Cys Glu Phe Asp Ala Val Leu Ala Asp Phe Ala Ser Pro 
10 15 20 25 

ttc cac gag cgc cac ttc cac tac gag gag- cac ctg gag cgc atg aag 269 
Phe His Glu Arg His Phe His Tyr Glu Glu His Leu Glu Arg Met Lys 
30 35 40 

egg cgc age age gec agt gtc age gac age age ggc ttc age gac teg 317 
Arg Arg Ser Ser Ala Ser Val Ser Asp Ser Ser iGly Phe Ser Asp Ser 
45 50 55 

gag agt gca gat tea ctt tat agg aac age ttc age ttc agt gat gaa 365 
Glu Ser Ala Asp Ser Leu Tyr Arg Asn Ser Phe Ser Phe Ser Asp Glu 
60 65 70 

aaa ctg aat tct cca aca gac tct ace cca get ctt etc tct gee act 
Lys Leu Asn Ser Pro Thr Asp Ser Thr Pro Ala Leu Leu Ser Ala Thr 
75 80 85 

gtc act cct cag aaa get aaa tta gga gac aca aaa gag eta gaa gee 
Val Thr Pro Gin Lys Ala Lys Leu Gly Asp Thr Lys Glu Leu Glu Ala 
90 95 100 105 

ttc att get gat ctt gac aaa act tta gca agt atg tgaaacaaga 
Phe lie Ala Asp Leu Asp Lys Thr Leu Ala Ser Met 
110 . H5 

agttctgggt cctttcatca taagggagaa gcttcagaaa gttccgagga ectgetaaaa 567 

tcagctacta gaatctgctg ccagagggga caaagaegtg cactcaacct tctaccaggc 627 

cactctcagg ctcaccttaa aatcagccct tgatcccatt tctgggcaat ttagacagtg 687 

aaactgactt tgtttacctg ettgeagcat attagaacag acgatccatg ctaatattgt 747 

attttctctt aaaacatagc tttcctgtaa tttaaagtgc ttttatgaaa atatttgtaa 807 

ttaattatat atagttggaa atagcagtaa gctttcccat tataatatat ttttgtatac 867 

aaataaaatt tgaactgaac ctcgtgcc 895 

<210> 78 

<211> 117 

<212> PRT 

<213> NM 014059 RGC32 



413 



461 



507 



; ) O 

WO 2004/022778 PCT/AU2003/001166 

206 

<400>. 78 

Met Lys Pro Pro Ala Glu Asp Leu Ser Asp Ala Leu Cys Glu Phe Asp 
1 5 io 15 

Ala Val Leu Ala Asp Phe Ala Ser Pro Phe His Glu Arg His Phe His 
20 25 30 

Tyr Glu Glu His Leu Glu Arg Met Lys Arg Arg Ser Ser Ala Ser Val 
35 40 45 

Ser Asp Ser Ser Gly Phe Ser Asp Ser Glu Ser Ala Asp Ser Leu Tyr 
50 55 60 

Arg Asn Ser Phe Ser Phe Ser Asp Glu Lys Leu Asn Ser Pro Thr Asp 
65 70 * 75 80 

Ser Thr Pro Ala Leu Leu Ser Ala Thr Val Thr Pro Gin Lys Ala Lys 
85 go 95 

Leu Gly Asp Thr Lys Glu Leu Glu Ala Phe He Ala Asp Leu Asp Lys 
!00 105 HO 

Thr Leu Ala Ser Met 
115 

<210> 79 

<211> 1564 

<212> DNA 

<213> m 018004 FLJ10134 



<220> 

<221> CDS 

<222> (314) (1138) 

<223> 



<400> 79 

gaagcacatc tggacagctg tgcggcctcc ttgcgggccg acgtcagccg agcacgtccc 60 

ccacgtcctc tccttctcgc cacttattat ttattcgttt tcccaaagaa gcgactaggg 120 

acccaagttt aaaaattcct ccccccactc aatgcgagac gtggccagat cccatccaac 180 

acacggttta attttcatgg ggctctggga tcaaaagaac agaaacagca acaacaaaag 240 

cccagccgct gtctgatttt aagctggcaa agtgggaaaa ataaagtgtt gagtaaacag 300 

accaagttgg ate atg ggg aat ttc aga ggt cat gec etc cct gga ace 349 
Met Gly Asn Phe Arg Gly His Ala Leu Pro Gly Thr 
1 5 .10 



o 
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Pne Pae ml Tit lit If t gg tg9 t9t aCa aag agt att <*g aag 
Phe Phe Phe He lie Gly Leu Trp Trp Cys Thr Lys Ser He Leu Lys 

15 .20 25 

tat ate tgc aaa aag caa aag cga acc tgc tat ctt ggt tec aaa aca 
Tyr lie Cys Lys Lys Gin Lys Arg Thr Cys Tyr Leu Gly Ser Lys 

35 An 



1« 150 



155 



t! 9 x 9 ^? 2*? ttt Ct9 aca 990 cto g " 9== tte eta gag tte ctt 
Val Leu Val Val Phe Leu Thr Gly Leu Val Ala Phe Leu Glu Pne Leu 
160 165 



170 



397 



445 



?™ pk° ^ x tg gaa 3tt ttg ^ag.gga att aca ata gtt ggc atg 4 93 

Leu Phe Tyr Arg Leu Glu He Leu Glu Gly He Thr lie Val Hy Met 
50 cc 



55 60 

2? H? C 3t9 9Ct 999 gag cag fc tt. att cct gga ggg ccc cat 541 
Ala Leu Thr Gly Met Ala Gly Glu Gin Phe lie Pro Ily lly Pro His 
65 70 75 

Itl ? tB ? 3C tat 333 Caa 99t Cac tgg aat caa cfc c ctg ggc 589 
Leu Met Leu Tyr Asp Tyr Lys Gin Gly His Trp Asn Gin Leu Leu G^y 

80 85 90 

5 2 2 S % S S S 2 £ g a a ~ - K « 

yi 100 105 

■S S £ SS £ S S 2E £ £ £ B £ £ S 5 685 

115 120 

£u Mel tin aat ?f C tta tfct 9t9 gag gcc fctt a t<= «c tac aac 733 
Leu Met Leu Ser Asn Ala Leu Phe Val Glu Ala Phe He Phe Tyr Asn 

130 135 J40 

ml 5£ o at 2?° °" 9aa atg ctg gac atc tfct 9tg cac cag ctg ctq 78 i 
Hrs Thr Has Gly Arg Glu Met Leu Asp lie Phe Val His Gin Leu Leu 



829 



877 



925 



gtt egg aac aat gta ctt ctg gag eta. ttg egg tea agt etc att ctg 
Val Arg Asn Asn Val Leu Leu Glu Leu Leu Arg Ser Ser Leu lit Leu 
1,b 180 185 

l2 G?n 111 t 99 ^ ttfc Cag att " a ttt Ste ctg tat ccc ccc 
Leu Gin Gly Ser Trp Phe Phe Gin He Gly Phe Val Leu Tyr Pro Pro 

195 200 

Ser III It III IT t 99 93t Ct9 atg gat cat gaa aa t att ttg ttt 973 
Ser Gly Gly Pro Ala Trp Asp Leu Met Asp His Glu Asn He Leu Phe 

210 . 215 220 

f tC %1 C ^ a tgc ttfc tgt.tgg cat tat gca gta acc att gtc atc ott 1091 
Leu Thr He Cys Phe Cys Trp His Tyr Ala Val Thr He Val He vtl 
225 230 235 

111 E? w T 3t ??* 2° att aCC tgg ttg gtt aaa tct aga ctt aag 1069 
Gly Met Asn Tyr Ala Phe He Thr Trp Leu Val Lys Ser Arg Leu LyI 

245 250 

III fl C , Z 30 o° C 938 gt;t gga ctt ctg aaa aat get gaa cga gaa 1117 
Arg Leu Cys Ser Ser Glu Val Gly Leu Leu Lys Asn Ala Glu Arg Glu 
255 260 265 

£ Glu se" S & £ £ gagCttCCag ""tctaga 

270 275 
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taaacctttt cttttttaca ttgttcttgg ttttgtttct cgatottttg tttggagaac 1228 

agctggctaa ggatgactct aagtgtactg tttgcatttc caatttggtt aaagtatttg 1288 

aatttaaata ttttcttttt agctttgaaa atattttggg tgatactttc attttgcaca 1348 

tcatgcacat catggtattc aggggctaga gtgatttttt tccagattat ctaaagttgg 1408 

atgoccacac tatgaaagaa atatttgttt tatttgcctt atagatatgc tcaaggttac 1468 

tgggcttgct actatttgta actccttgac catggaatta tacttgttta tcttgttgct 1528 

.gcaatgagaa ataaatgaat gtatgtattt tggtgc 1564 

<210> 80 
<211> 275 
<212> PRT 

<213> NM_018004 FLJ10134 
<400> 80 

Met Gly Asn Phe Arg Gly His Ala Leu Pro Gly Thr Phe Phe Phe He 
5 10 is 

He Gly Leu Trp Trp Cys Thr Lys Ser lie Leu Lys Tyr lie Cys Lys 
20 25 30 

Lys Gin Lys Arg Thr Cys Tyr Leu Gly Ser Lys Thr Leu Phe Tyr Arg 
J5 40 45 

Leu Glu He Leu Glu Gly lie Thr He Val Gly Met Ala Leu Thr Gly 
50 55 60 

Met Ala Gly Glu Gin Phe He Pro Gly Gly Pro His Leu Met Leu Tyr 
65 70 75 so 

Asp Tyr Lys Gin Gly His Trp Asn Gin Leu Leu Gly Trp His His Phe 
85 90 95 

Thr Met Tyr Phe Phe Phe Gly Leu Leu Gly Val Ala Asp He Leu Cys 
100 105 . no 

Phe Thr lie Ser Ser Leu Pro Val Ser Leu Thr Lys Leu Met Leu Ser 
115 120 125 

Asn Ala Leu Phe Val Glu Ala Phe He Phe Tyr Asn His Thr His Gly 
1J0 135 140 

Arg Glu Met Leu Asp He Phe Val His Gin Leu Leu Val Leu Val Val 
145 I 50 155 i 6 o 
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Phe Leu Thr Gly Leu Val Ala Phe Leu Glu Phe Leu Val Arg Asn Asn 
165 170 175 

Val Leu Leu Glu Leu Leu Arg Ser Ser Leu He Leu Leu Gin Gly Ser 
180 185 190 

Trp Phe Phe Gin lie Gly Phe Val Leu Tyr Pro Pro Ser Gly Gly Pro 
195 200 205 

Ala Trp Asp Leu Met Asp His Glu Asn He- Leu Phe Leu Thr He Cys 
210 215 220 

Phe Cys Trp His Tyr Ala Val Thr lie' Val lie Val Gly Met Asn Tyr 
225 230 ' 235 240 

Ala Phe He Thr Trp Leu Val Lys Ser Arg- Leu Lys Arg Leu Cys Ser 
245 250 255 

Ser Glu Val Gly Leu Leu Lys Asn Ala Glu Arg Glu Gin Glu Ser Glu 
2fi 0 265 270 

Glu Glu Met 
275 



<210> 81 

<211> 2311 

<212> DNA 

<213> NM_004004 GJB2, connexin 26 



<220> 

<221> CDS 

<222> (199).. (876) 

<223> 

<400> 81 

gatttaatcc tatgacaaac taagttggtt ctgtcttcac ctgttttggt gaggttgtgt 60 

aagagttggt gtttgctcag gaagagattt aagcatgctt gcttacccag actcagagaa 120 

gtctccctgt tctgtcctag ctatgttcct gtgttgtgtg cattcgtctt ttccagagca 180' 

aaccgcccag agtagaag atg gat tgg ggc acg ctg cag acg ate ctg ggg 231 
Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly 
1 5 io 

ggt gtg aac aaa cac tec acc age att gga aag ate tgg etc acc gtc 279 
Gly Val Asn Lys His Ser Thr Ser He Gly Lys He Trp Leu Thr Val 
15 20 ■ 25 

etc ttc att ttt cgc att atg ate etc gtt gtg get gca aag gag gtg 327 
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Leu Phe lie Phe Arg He Met lie Leu Val Val Ala. Ala Lys Glu Val 
30 35 40 

III fit 9 f° 9a ° ttt 9t ° tgc aac acc ct 3 cag cca ggc 375 

Trp Gly Asp Glu Gin Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly 

45 50 55 

rlt t 39 S* 9 t9C t3C gat Cac tac ttc ccc atc tec cac ate egg 42 3 
Cys Lys Asn Val Cys Tyr Asp His Tyr Phe Pro lie Ser His lie Arg 
60 65 . 70 ?5 y 

eta tgg gee ctg cag ctg atc ttc gtg tee age cca gcg etc eta gtg 471 
Leu Trp Ala . Leu Gin Leu He Phe Val .Ser Ser Pro Ala Leu Leu Val 
80 85 90 

lit ml Tit £? 9 ?° £ ac * gg aga cat gag aag aag agg aag ttc a t<= 519 
Ala Met His Val Ala Tyr Arg Arg His Glu Lys Lys Arg Lys Phe He 

95 100 !o 5 

Lvs III rfn f 39 c 9t 939 ttfc 389 gac atc gag W atc aaa acc 567 
Lys Gly Glu He Lys Ser Glu Phe Lys Asp He Glu Glu He Lys Thr 

110 115 120 

Gin TV 9 » g ° £° 933 " C tCC Ctg tgg tg 9 ac c tac aca age age 615 
Gin Lys Val Arg He Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser 
125 130 135 

2° 099 9tC atC ttc gaa 9cc gee ttc atg tac gtc ttc tat 66? 
lie Phe Phe Arg Val lie Phe Glu Ala Ala Phe Me? Tyr Val Pne Tyr 
145 150 155 

™ m» 9 ^ ? a ° g ?° ttC tCC atg cag cgg cfc 9 5tg aao tgc aac gee 711 
Val Met Tyr Asp Gly Phe Ser Met Gin Arg Leu Val Lys Cys Asn 111 

160 165 170 

tgg cct tgt ccc aac act gtg gac tgc ttt gtg tec egg ccc aca aaa 7*0 
Trp Pro Cys Pro Asn Thr Val Asp Cys Phe Val Ser £g Pro Thr Ilu 
1,b 180 185 

= B K ^ a ~ 2 » "« s » 2 is s s: s 807 

190 195 200 

ctg ctg aat gtc act gaa ttg tgt tat ttg eta att aga tat tot tct ass 
Leu Leu Asn Val Thr Glu Leu Cys Tyr Leu Leu lie Arg iyr c£ Ser 

5 S2 sir Lys £s £ £ t^" 9 CCCagttg " aga " aagaa 
220 225 

atagaeagca tgagagggat gaggcaaccc gtgetcaget gtcaaggctc agtegecage 966 

atttcccaae acaaagattc tgaccttaaa tgcaaccatt tgaaacccct gtaggectea 1026 

ggtgaaactc cagatgccac aatgagctct gctcccctaa ageetcaaaa caaaggecta 1086 

attctatgec tgtcttaatt ttetttcact taagttagtt ccactgagac cccaggetgt 1146 

taggggttat tggtgtaagg tactttcata ttttaaacag aggatategg catttgtttc 1206 

tttctctgag gacaagagaa aaaagecagg ttccacagag gacacagaga aggtttgggt 1266 

gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat tggttctttc 1326 

atttgetttg gaagttttaa tctctaacag tggacaaagt taccagtgcc ttaaactctg 1386 

ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg taaagatgtt 1446 
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ctggatacca 


ttatatgttc cccctgtttc 


agaggctcag attgtaatat gtaaatggta 


1506 


tgtcattcgc 


tactatgatt taatttgaaa 


tatggtcttt tggttatgaa tactttgcag 


1566 


cacagctgag 


agaggctgtc tgttgtattc 


attgtggtca tagcacctaa caacattgta 


1626 


gcctcaatcg 


agtgagacag actagaagtt 


cctagttggc ttatgatagc aaatggcctc 


1686 


atgtcaaata 


ttagatgtaa ttttgtgtaa 


gaaatacaga ctggatgtac caccaactac 


1746 


tacctgtaat 


gacaggcctg tccaacacat 


ctcccttttc catgctgtgg tagccagcat 


1806 


cggaaagaac 


gctgatttaa agaggtgagc 


ttgggaattt tattgacaca gtaccattta 


1866 


atggggagac 


aaaaatgggg gccaggggag 


ggagaagttt ctgtcgttaa aaacgagttt 


1926 


ggaaagactg 


gactctaaat tctgttgatt 


aaagatgagc tttgtctacc ttcaaaagtt 


1986 


tgtttggctt 


acccccttca gcctccaatt 


ttttaagtga aaatataact aataacatgt 


2046 


gaaaagaata 


gaagctaagg tttagata'aa 


tattgagcag atctatagga agattgaacc . 


2106 


tgaatattgc 


cattatgctt gacatggttt 


ccaaaaaatg gtactccaca tacttcagtg 


2166 


agggtaagta 


ttttcctgtt gtcaagaata 


gcattgtaaa agcattttgt aataataaag 


2226 


aatagcttta 


atgatatgct tgtaactaaa 


ataattttgt aatgtatcaa atacatttaa 


2286 


aacattaaaa 


tataatctct ataat 




2311 



<210> 82 

<211> 226 

<212> PRT 

<213> NM_004004 GJB2, connexin 26 

<400> 82 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
1 5 10 15 

Ser Thr Ser lie Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 " 30 

He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 

Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 
50 55 60 

Tyr Asp His Tyr Phe Pro He Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 80 



: ) 

PCT/AU2003/001166 

211 



Leu He Phe Val Ser Ser Pro Ala Leu Leu Val Ala Met His Val Ala 
85 90 95 
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Tyr Arg Arg His Glu Lys Lys Arg Lys Phe lie Lys Gly GXu He Lys 
100 105 HO 



Ser Glu Phe Lys Asp He Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 



Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 135 140 



He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 



150 



155 



160 



Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn 
165 170 175 



Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 
180 185 190 



Val Phe Met He Ala Val Ser Gly He Cys He Leu Leu Asn Val Thr 
195 200 205 



Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser Gly Lys Ser Lys Lys 
210 . 215 220 



Pro Val 
225 



<210> 83 

<211> 2389 

<212> DNA 

<213> NM 002514 



NOV1 , nephroblastoma 



overexpressed gene 



<220> 

<221> CDS 

<222> (73) . . (1143) 

<223> 



<400> 83 

gggaaggcga gcagtgccaa tctacagcga agaaagtctc gtttggtaaa agcgagaggg 60 

gaaagcctga gc atg cag agt gtg cag age acg age ttt tgt etc cga aag 111 
Met Gin Ser Val Gin Ser Thr Ser Phe Cys Leu Arg Lys 
1 5 io 

cag tgc ctt tgc ctg acc ttc ctg ctt etc cat etc ctg gga cag gtc 159 
Gin Cys Leu Cys Leu Thr Phe Leu Leu Leu His Leu Leu Gly Gin Val 
15 20 25 

get gcg act cag cgc tgc cct ccc cag tgc ccg ggc egg tgc cct gcg 207 
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Ala Ala Thr Gin Arg Cys Pro Pro Gin Cys Pro Gly Arg Cys Pro Ala 
30 35 * 40 45 

acg ccg ccg acc tgc gcc ccc ggg gtg cgc gcg gtg ctg gac ggc tgc 255 
Thr Pro Pro Thr Cys Ala Pro Gly Val Arg Ala Val Leu Asp Gly Cys 
50 55 . 6o 

tea tgc tgt ctg gtg tgt gcc cgc cag cgt ggc gag age tgc tea gat 303 
Ser Cys Cys Leu Val Cys Ala Arg Gin Arg Gly Glu Ser Cys Ser Asp 
65 70 75 

ctg gag cca tgc gac gag age agt ggc etc tac tgt gat cgc age gcg 351 
Leu Glu Pro Cys Asp Glu Ser Ser Gly Leu Tyr Cys Asp Arg Ser Ala 
80 85 90 

gac ccc age aac cag act ggc ate tgc acg gcg gta gag gga gat aac 399 
Asp Pro Ser Asn Gin Thr Gly He Cys Thr Ala Val Glu Gly Asp Asn 
95 100 105 

tgt gtg ttc gat ggg gtc ate tac cgc agt gga gag aaa ttt cag cca 447 
Cys Val Phe Asp Gly Val He Tyr Arg Ser Gly Glu Lys Phe Gin Pro 
110 115 120 125 

age tgc aaa ttc cag tgc acc tgc aga gat ggg cag att ggc tgt gtg 495 
Ser Cys Lys Phe Gin Cys Thr Cys Arg Asp Gly Gin He Gly Cys Val 
130 135 140 

ccc cgc tgt cag ctg gat gtg eta ctg cct gag- cct aac tgc cca get 543 
Pro Arg Cys Gin Leu Asp Val Leu Leu Pro Glu Pro Asn Cys Pro Ala 
145 150 155 

cca aga aaa gtt gag gtg cct gga gag tgc tgt gaa aag tgg ate tgt 591 
Pro Arg Lys Val Glu Val Pro Gly Glu Cys Cys Glu Lys Trp He Cys 
160 165 170 

ggc cca gat gag gag gat tea ctg gga ggc ctt acc ctt gca get tac 639 
Gly Pro Asp Glu Glu Asp Ser Leu Gly Gly Leu Thr Leu Ala Ala Tyr 
175 180 185 

agg cca gaa gcc acc eta gga gta gaa gtc tct gac tea agt gtc aac 687 
Arg Pro Glu Ala Thr Leu Gly Val Glu Val Ser Asp Ser Ser Val Asn 
190 195 200 205 

tgc att gaa cag acc aca gag tgg aca gca tgc tec aag age tgt ggt 735 
Cys He Glu Gin Thr Thr Glu Trp Thr Ala Cys Ser Lys Ser Cys Gly 
210 215 220 



atg ggg ttc tec acc egg gtc acc aat agg aac cgt caa tgt gag atg 
Met Gly Phe Ser Thr Arg Val Thr Asn Arg Asn Arg Gin Cys Glu Met 
225 230 235 



783 



ctg aaa cag act egg etc tgc atg gtg egg ccc tgt gaa caa gag cca 831 

Leu Lys Gin Thr Arg Leu Cys Met Val Arg Pro Cys Glu Gin Glu Pro 

240 245 250 

gag cag cca aca gat aag aaa gga aaa aag tgt etc cgc acc aag aag 879 

Glu Gin Pro Thr Asp Lys Lys Gly Lys Lys Cys Leu Arg Thr Lys Lys 

255 260 265 

tea etc aaa gcc ate cac ctg cag ttc aag aac tgc acc age ctg cac 927 

Ser Leu Lys Ala He His Leu Gin Phe Lys Asn Cys Thr Ser Leu His 

270 275 280 285 

acc tac aag ccc agg ttc tgt ggg gtc tgc agt gat ggc cgc tgc tgc 975 

Thr Tyr Lys Pro Arg Phe Cys Gly Val Cys Ser Asp Gly Arg Cys Cys 

290 295 300 

act ccc cac aat acc aaa acc ate cag gca gag ttt cag tgc tec cca 1023 
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Thr Pro His Asn Thr Lys Thr lie Gin Ala Glu Phe Gin Cys Ser Pro 
305 310 315 

ggg caa ata gtc aag aag cca gtg atg gtc att ggg acc tgc acc tgt 1071 
Gly Gin He Val Lys Lys Pro Val Met Val He Gly Thr Cys Thr Cys 
320 325 330 

cac acc aac tgt cct aag aac aat gag gcc ttc etc cag gag ctg gag 1119 
His Thr Asn Cys Pro Lys Asn Asn Glu Ala Phe Leu . Gin Glu Leu Glu 
335 340 345 

ctg aag act acc aga ggg aaa atg taacctatca ctcaagaagc . acacctacag 1173 
Leu Lys Thr Thr Arg Gly Lys Met 
350 355 



agcacctgta 


gctgctgcgc 


cacccaccat 


caaaggaata 


taagaaaagt 


aatgaagaat 


1233 


cacgatttca 


tccttgaatc 


ctatgtattt 


tcctaatgtg 


atcatatgag 


gacctttcat 


1293 


atctgtcttt 


tatttaacaa 


aaaatgtaat 


taactgtaaa 


cttggaatca 


aggtaagctc 


1353 


aggatatggc 


ttaggaatga 


cttactttcc 


tgtggtttta 


ttacaaatgc 


aaatttctat 


1413 


aaatttaaga 


aaacaagtat 


ataatttact 


ttgtagactg 


tttcacattg 


cactcatcat 


1473 


attttgttgt 


gcactagtgc 


aattccaaga 


aaatatcact 


gtaatgagtc 


agtgaagtct 


1533 


agaatcatac 


ttaacatttc 


attgtacaag 


tattacaacc 


atatattgag 


gttcattggg 


1593 


aagattctct 


attggctccc 


tttttgggta 


aaccagctct 


gaacttccaa 


gctccaaatc 


1653 


caaggaaaca- 


tgeagctett 


caacatgaca 


tccagagatg 


actattactt 


ttctgtttag 


1713 


ttttacacta 


ggaaacgtgt 


tgtatctaca 


gtaatgaaat 


gtttactaag 


tggactggtg 


1773 


tcataaactt 


tctccattta 


agacacattg 


actcctttcc 


aatagaaaga 


aactaaacag 


1833 


aaaactccca 


atacaaagat 


gactggtccc 


tcatagccct 


cagacattta 


tatattggaa 


1893 


getgetgagg 


cccccaagtt 


ttttaattaa 


gcagaaacag 


catattagca 


gggattctct 


1953 


catctaactg 


atgagtaaac 


tgaggcccaa 


ageacttget 


tacatcctct 


gatagctgtt 


2013 


tcaaatgtgc 


attttgtgga 


attttgagaa 


aaatagagca 


aaatcaacat 


gactggtggt 


2073 


gagagaccac 


acattttatg 


agagtttgga 


attattgtag 


acatgcccaa 


aacttatcct 


2133 


tgggccataa 


ttatgaaaac 


tcatgatcaa 


gatatatgtg 


tatacataca 


tgtatctggt 


2193 


ttgtcaggct 


acaaggtagg 


ctgeaaaatt 


aaatctagac 


attcttttaa 


tgccaccaca 


2253 


cgtgttccgc 


ttctctcttt 


taaagtattt 


ataaaaatat 


aaattgtaca 


ttttgtaaaa 


2313 


tattatgttt 


gatttctcta 


cttgtcatat 


cactaaataa 


acacgatttt 


attgetgaaa 


2373 


aaaaaaaaaa 


aaaaaa 
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<210> 84 

<211> 357 

<212> PRT 

<213> NM_002514 NOV1, nephroblastoma overexpressed gene 



WO 2004/022778 PCT/AU2003/001166 

215 



<400> 84 

Met Gin Ser Val Gin Ser Thr Ser Phe Cys Leu Arg Lys Gin Cys Leu 
1 5 10 15 



Cys Leu Thr Phe Leu Leu Leu His Leu Leu Gly Gin Val Ala Ala Thr 
20 25 30 



Gin Arg Cys Pro Pro Gin Cys Pro Gly Arg Cys Pro Ala Thr Pro Pro 
35 40 45 



Thr Cys Ala Pro Gly Val Arg Ala Val Leu Asp Gly Cys Ser Cys Cys 
50 55 60 



Leu Val Cys Ala Arg Gin Arg Gly Glu Ser Cys Ser Asp Leu Glu Pro 
65 70 75 80 



Cys Asp Glu Ser Ser Gly Leu Tyr Cys Asp Arg Ser Ala Asp Pro Ser 
85 90 95 



Asn Gin Thr Gly lie Cys Thr Ala Val Glu Gly Asp Asn Cys Val Phe 
100 105 110 



Asp Gly Val lie Tyr Arg Ser Gly Glu Lys Phe Gin Pro Ser Cys Lys 
115 120 125 



Phe Gin Cys Thr Cys Arg Asp Gly Gin He Gly Cys Val Pro Arg Cys 
130 135 140 



Gin Leu Asp Val Leu Leu Pro Glu Pro Asn Cys Pro Ala Pro Arg Lys 
145 150 155 160 



Val Glu Val Pro Gly Glu Cys Cys Glu Lys Trp lie Cys Gly Pro Asp 
165 170 175 



Glu Glu Asp Ser Leu Gly Gly Leu Thr Leu Ala Ala Tyr Arg Pro Glu 
180 185 190 



Ala Thr Leu Gly Val Glu Val Ser Asp Ser Ser Val Asn Cys He Glu 
195 200 205 



Gin Thr Thr Glu Trp Thr Ala Cys Ser Lys Ser Cys Gly Met Gly Phe 
210 215 220 



Ser Thr Arg Val Thr Asn Arg Asn Arg Gin Cys Glu Met Leu Lys Gin 
225 230 235 240 



Thr Arg Leu Cys Met Val Arg Pro Cys Glu Gin Glu Pro Glu Gin Pro 
245 250 255 



Thr Asp Lys Lys Gly Lys Lys Cys Leu Arg Thr Lys Lys Ser Leu Lys 
260 265 270 
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WO 2004/022778 PCT/AU2OO3/001166 

. 216 



Ala He His Leu Gin Phe Lys Asn Cys Thr Ser Leu His Thr Tyr Lys 
275 280 285 



Pro Arg Phe Cys Gly Val Cys Ser Asp Gly Arg Cys Cys Thr Pro His 
290 295 300 



Asn Thr Lys Thr He Gin Ala Glu Phe Gin Cys Ser Pro Gly Gin He 
305 , 310 315 320 



Val Lys. Lys Pro Val Met Val He Gly Thr Cys Thr Cys His Thr Asn 
325 330 335 



Cys Pro Lys Asn Asn Glu Ala Phe Leu Gin Glu Leu Glu Leu Lys Thr 
340 345 350 



Thr Arg Gly Lys Met 
355 
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This international search report has not been established in respect of certain claims under Article 17(2)(a) for-the following 
reasons: 

1. Q Claims Nos: 

because they relate to subject matter not required to be searched by mis Authority, namely: 



X Claims Nos: 70-73 



3. 



□ 



because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 

The claims are not limited to the technical features of the invention as disclosed in, and supported by, 
the specification. The claims relate to methods of assessing the promoter regions of specific genes with 
respect to hypermethylation and ovarian cancer. However the specification only discloses sequences 
derived from cDNA clones and does not disclose any promoter sequences. 

Claims Nos : 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 
6.4(a) 



Box II Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
See supplementary sheet 



| | As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims 

[ | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

[X | As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos.:2-5, 7, 9, 15, 19, 20, 22, 24, 30, 
45 and 47-50 in full and claims 1, 10-12, 16-18, 25-27, 31-43, 51-53 and 60-69 as they relate to table 3. 



4 C ^° requiied additional search fees were timely paid by the applicant Consequently, this international search report 

is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
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| [ The additional search fees were accompanied by the applicant's protest. 
|X I N° protest accompanied the payment of additional search fees. 
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Continuation of Box No: II 

The application claims more than one invention. Rule 13.1 of the PCT states the principle that an International 
Application should relate to only one invention or, if there is more than one invention, that the inclusion of those 

" inventions in one International Application is only permitted if all inventions are so linked as to form a single 
general inventive concept. 

Rule 13.2 of the PCT defines the method for determining whether the requirement of unity of invention is satisfied 
in respect of a group of inventions claimed in an International application. Unity of invention exists only when 
there is a technical relationship among the claimed inventions involving one or more of the same or corresponding 
"special technical features." Trie expression "special technical features" is defined in Rule 132 as meaning those 
technical features that define a contribution which each of the inventions, considered as a whole, makes over the 
prior art. The determination is made on the contents of the claims as interpreted in light of the description and 
drawings (if any). 

The claims and the description relate to methods for the diagnosis, prediction and treatment of ovarian cancer, using 
gene and peptide sequences whose expression is altered in ovarian cancer. In particular the claims define methods 
relating to the use of over 693 specific gene and/or peptide sequences. 

Although all of the claims share the common feature that they all relate to methods or apparatus that involve the use of 
specific gene or peptide sequences whose expression is altered in ovarian cancer,.fhis feature is known (see the 
documents listed below). As such this feature cannot be regarded as a special technical feature and cannot confer 
unity on the inventions relating to the use of specific gene or peptide sequences for the detection or treatment of 
ovarian cancer. 

Dl US 5 912 142 
D2 US 6 268 165 
D3 WO 01 21653 

D4 Bayani et al (2002) Cancer Research 62, 3466-76 

Furthermore, there is nothing in the specification to suggest that the genes can be further divided into groups, where 
sequences within a group share a common special technical feature. Although the specification discloses mat some of 
the genes can be classified into further, narrower groups, these further divisions also cannot be regarded as special 
technical features. These divisions correspond to groups of genes that are down-regulated or up-regulated in cancers 
and genes that are associated with specific types of ovarian cancers, such as epithelial or mucinous ovarian cancer. 
However these groups are known, as are methods of using genes specifically associated with these narrower groups for 
the treatment and detection of specific subsets of ovarian cancers. See in particular D4 

Although there is a lack of unity the ISA has searched, as a service to the applicant, five sets often sequences for five 
search fees. 



These sequences comprise SEQIDNOS 1- 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 
46, 48, 50, 52, 53, 55, 57, 59, 60- 63, 65, 67, 69, 71-75, 77, 79, 81, 83. 



FormPCTflSA/210 (second sheet)(July 1998) 



o 



1 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 



International application No. 
PCT/AU03/01166 



This Annex lists the known "A M publication level patent family members relating to the patent documents cited in the 
above-mentioned international search report. The Australian Patent Office is in no way liable for these particulars 
which are merely given for the purpose of information. 



Patent Document Cited in 


Patent Family Member 


Search Report 





WO 0200677 



AU 


29508/01 


AU 


30958/01 


AU 


36459/01 


AU 


36460/01 


AU 


36461/01 


AU 


a** A ***** isf\ ■* 

36462/01 


AU 


36463/01 


AU 


36464/01 


AU 


36465/01 


AU 


36466701 


AU 


37943/01 


AU 


37944/01 


AU 


37947/01 


AU 


37949/01 


AU 


37950/01 


AU 


37951/01 


AU 


37952/01 


AU 


37953/01 


AU 


37954/01 


AU 


37955/01 


AU 


37957/01 


AU 


37958/01 


AU 


38585/01 


AU 


39726/01 


AU 


39727/01 


AU 


39728/01 


AU 


41402/01 


AU 


41403/01 


AU 


41404/01 


AU 


41405/01 


AU 


41406/01 , 


AU 


41407/01 


AU 


41408/01 


AU 


41409/01 


AU 


41410/01 


AU 


41411/01 


AU 


41412/01 


AU 


41413/01 


AU 


41414/01 


AU 


41415/01 


AU 


41416/01 


• AU 


41417/01 


AU 


41418/01 


AU 


41419/01 


AU 


43134/01 


AU 


43135/01 


AU 


43136/01 


AU 


43137/01 


AU 


45262/01 


AU 


45354/01 


AU 


47190/01 


AU 


47191/01 


AU 


49052/01 


AU 


49053/01 


AU 


49054/01 


AU 


50767/01 


AU 


50768/01 


AU 


50769/01 


AU 


50770/01 


AU 


50771/01 


AU 


50772/01 


AU 


52878/01 


AU 


52879/01 


AU 


55162/01 


AU. 


57912/01 


AU 


60969/01 


AU* 


62899/01 


AU 


66787/01 


CA 


2392398 


CA 


2392422 


CA 


2392428 


CA 


2392438 


CA 


2392450 


CA 


2392751 


CA 


2392757 


CA 


2393002 


CA 


2393616 


CA 


2393618 


CA 


2393652 


CA 


2393912 


CA 


2393941 


CA 


2393954 


CA 


2394022 


CA 


2394039 


CA 


2394841 


CA 


2395178 


CA 


2395295 


CA 


2395398 


CA 


2395403 


CA 


2395654 


CA 


2395666 


CA 


2395671 


QA 


2395676 


CA 


2395693 


CA 


2395699 


CA 


2395724 



Form PCT/ISA/2 10 (citation family annex) (July 1998) 



Information on patent family members 


International application No. 




PCT/AU03/01166 



CA 


2395729 


CA 


2395734 


CA 


2395738 


CA 


2395787 


CA 


2395794 


CA 


2395811 


CA 


2395815 


CA 


2395816 


CA 


2395827 


CA 


2395838 


CA 


2395849 * 


CA 


2395857 


CA 


2395858 


CA 


2395872 


CA 


2395885 


CA 


2395889 


CA 


2396719 


CA 


2397407 


CA 


2397502 


CA 


2397839 


CA 


2398227 


CA 


2398275 


CA 


2398411 


CA 


2398877 


EP 


1251863 


EP 


1252176 


EP 


1252185 


EP 


1252289 


EP 


1252290. 


EP 


1252297 


EP 


1252302 


EP 


1252303 


EP 


1252312 


EP 


1252326 


EP 


1252337 


EP 


1254147 


EP 


1254148 


EP 


1254150 


EP 


1254151 


EP 


1254152 


EP 


1254153 


EP 


1254154 


EP 


1254157 


EP 


1254165 


EP 


1254170 


EP 


1254171 


EP 


1254172 


EP 


1254173 


EP 


1254217 


EP 


1254218 


EP 


1254219 


EP 


1254228 


EP 


1254242 


EP 


1254248 


EP 


1254272 


EP 


1255766 


EP 


1255767 ■ 


EP 


1255768 


EP 


1255776 


EP 


1255777 


EP 


1255778 


EP 


1255817 


EP 


1255864 


EP 


1255869 


EP 


1259525 


EP 


1259526 


EP 


1259528 


EP 


1259531 


EP 


1259540 


EP 


1259642 


EP 


1261380 


EP 


1261618 


EP 


1261619 


EP 


1261633 


EP 


1261634 


EP 


1261635 


EP 


1261637 


EP 


1261703 


EP 


1261742 


EP 


1261745 


EP 


1263944 


EP 


1265910 


EP 


1268527 


US 


2002042096 


US 


2002042386 


us 


2002061521 


us 


2002077270 


US 


2002086330 


us 


2002086353 


us 


2002086811 


us 


2002086820 


us 


2002086821 


US 


2002086822 


us 


2002086823 


us 


2002090615 


US 


2002090672 


us. 


2002090673 


us 


2002090674 


US 


2002094953 


us 


2002102638 


us 


2002119919 


US 


2002120103 


us 


2002132753 


us 


2002132767 


US 


2002147140 


us 


2002151009 


us 


2002151479 


US 


2002160493 


us 


2002161208 


us 


2002164685 


US 


2002165137 



Form PCT/ISA/210 (citation family annex) (July 1998) 



INTERNATIONAL SEARCH REPORT 
formation on patent family members 


International application No. 




PCT/AU03/01166 



WO 02071928 



US 


2002 loo/ 11 


TTO 

us 


2002173454 


T TO 

US 


TAA'IAI 14ZAQ 

2003013649 


T TO 


2003039993 


TTO 

us 


2003039994 




OAA0.A/M BOA 


Ub 


OAAO A/1 A OA A 

2003044904 


TTO 


O Aftl f\A A OAC 

20030449U5 


TTO 


OA AO A/1/1 Q AT 


US 


1 A AO A A A C c a 

2003049650 


TTO 

US 


2003049652 


US 


A AAOA;IA7m 

2003049/03 


US 


2003050231 


TTO 

US 


OAA^AC/IO^O 

2003054368 


US 


1AA1AC/J7T1 

20030543 /3 


T TO 

US 


nAAOAf Anne 

2003054375 


TTO 

US 


IfVWrtf if 1 "7*7 

2003054377 


TTO 

US 


O AA O A C A 1 H A 

2003054379 


US 


2003054420 


TTO 

US 


lAA^Af A07C 

2003059875 


TTO 
US 


OAAOA<AAAO 

2003059908 


US 


2003077602 


T TO 

US 


2003077606 


T TO 

US 


*AA1 ATMA1 

2003077703 


US 


2003077704 


T TO 

US 


2003077808 


TTO 

US 


OAAO AOA^O 1 

2003082681 


US 


2003082758 


US 


2003092102 


T TO 

US 


ftftftftftftft^x*-* i 
2003092611 


us 


2003092615 


US 


2003096346 


US 


2003108907 


us 


2003125246 


us 


2003139327 


US 


2003157508 


us 


2003175739 


us 


2003190707 


wo 


ft < ^ J i 

0154472 


wo 


0154473 


wo 


0154474 


wo 


0154708 


wo 


0154733 


wo 


0155162 


wo 


0155163 


wo 


0155164 


wo 


0155167 


wo 


0155168 


wo 


0155173 


wo 


0155200 


wo 


0155201 


wo 


0155202 


wo 


0155203 


wo 


0155204 


wo 


0155205 


wo 


0155206 


wo 


ft -1 /- /- ft ft --7 

0155207 


wo 


0155208 


wo 


0155300 


wo 


AT f f >)A1 

0155301 


wo 


0155302 


wo 


0155303 


wo 


A 1 c c n t\A 

0155304 


wo 


/An i Ar 

0155305 


wo 


0155306 


wo 


a 1 c c n r\'~i 

0155307 


wo 


0155308 


wo 


0155309 


TT77"\ 

wo 


A 1 ^ ^ T T A 

0155310 


wo 


0155311 


wo 


At f f lit 

0155312 


wo 


at ccn 1 1 

0155313 


wo 


0155314 


wo 


0155315 


wo 


At ccn i 

0155316 


wo 


A1 c cn ■t »n 

0155317 


wo 


A1 c cn i o 

0155318 


T17V\ 

wo 


At ff) 1 A 

015531? 


wo 


/-V 1 ^ ^ r\ ft ft 

0155320 . 


wo 


0155321 


wo 


Air C 1 TO 

0155322 


wo 


0155323 


WQ 


A 1 C Cn T yl 

0155324 


wo 


AT c c 

0155325 


wo 


0155326 


wo 


AT *T Cn**"! 

0155327 


wo 


AT COOO 

0155328 


wo 


0155329 


wo 


0155343 


wo 


0155350 


wo 


0155355 


wo 


0155364 


wo 


0155367 


wo 


0155368 


wo 


0155387 


wo 


0155388 


wo 


0155430 


wo 


0155440 


wo 


0155441 


wo 


0155447 


wo 


0155448 


wo 


0155449 


wo 


0157182 


wo 


0159063 


wo 


0159064 


wo 


0162789 


wo 


0164703 






us 


2003087250 











Form PCT/ISA/210 (citation family annex) (July 1998) 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 



International application No. 
| PCT/AU03/01166 



WO 


02102235 AU 


69022/00 


CA 


2381699 


EP 


1204764 




US 


2002015970 


US 


2002019330 


us 


2003124579 




US 


2003152926 


WO 


0111086 


WO 


02079492 




WO 


02086443 


WO 


02098358 


WO 


03003906 




WO 


03025138 


WO 


03042661 






US 
uo 


200300410? AU 


10703/99 


AU 


15324/99 


AU 


22122/99 




CA 


2305385 


C A 


2309358 


CA 


2321677 




PA 


2347835 


CA 


2353775 


EP 


1025227 




FP 




FP 


1135485 


EP 


1141284 




PP 


1 1d19R9 


FP 


1 1 5941 9 


EP 


1 1 59499 




EX 


1 1 / J JO J 


FP 
cr 


1 99090 S 


FP 


1263 94 R 




TKJ 


6A1070R 


UO 


9009079067 


US 

uo 


2009079092 

Z\JVJZU / LU7i 




TTC 
Uo 


9009079AQ6 


TTC 
UO 


9009079497 


US 
uo 


9002103195 

Z VUi i VJ 1 Z.J 




US 


9009 1 ioi 1R 


TTQ 
UO 


90091 19130 


US 
uo 


2002193463 




TTQ 
Uo 


9009 i?7S7fi 


uo 


9009197SR4 


US 
uo 


2002132252 




US 

Uu 


9009 13 99 ^3 

ZUUZ 1 J ZZ _) 3 


US 
uo 


9009149961 


US 
uo 


90091 S 6006 

Z WZ & J WW 




us 


90091 603 Rd 
ZUVJZ 1 UvJ OH 


T TQ 
UO 


9009161 199 


US 
uo 


90091 699 R4 

ZvvrLlU7^0*t 




TTQ 
UO 


7009177164 


US 
uo 


90091 771 6S 


us 

uo 


20021 771 RR 

£AJ\JAi kill OO 




TTQ 
UO 


9009177^ S3 


US 
uo 


90091R963R 


us 

uo 


20021 R2673 

Z»\7 V/Z» 1 OZ\J / J 




ns 

uo 


90091R3493 


US 
uo 


90021 R3494 


us 

uo 


20021 R3 505 




US 
uo 


9009 1 Q9900 


US 
uo 


2009192706 


us 

uo 


2002192752 




TTC 
UO 


9009 1 93999 
zuuz i yj£.yy 


US 
uo 


9009193300 


us 

uo 


2002197615 




US 
uo 


9009 1 Q7674 


US 
uo 


90021 981 4R 


us 

uo 


2002198149 




US 

uo 


2003003531 


us 

uo 


2003003546 


us 


2003004311 




IIS 

uo 


2003008297 


us 

uo 


2003008352 


us 


2003008353 




ITS 

uo 


2003009012 


us 

uo 


2003009013 


us 


2003013855 




ITS 

uo 


2003017476 


us 

uo 


2003017540 


us 


2003017541 




ITS 
uo 


9003017549 


US 

uo 


2003017543 


us 


2003017544 




US 
uo 


9003017563 


us 

uo 


2003017981 


us 


2003017982 




us 


2003018168 


us 


2003022187 


us • 


2003022239 




us 


2003022293 


us 


2003022294 


us 


2003022295 




us 


2003022296 


us 


2003022297 


us 


2003022298 




us 


2003022300 


us 


2003022301 


us 


2003022328 




us 


2003022331 


us 


2003027162 


us 


2003027163 




us 


2003027212 


us 


2003027262 


us 


2003027263 




us 


2003027264 


us 


2003027265 


us 


2003027266 




us 


2003027267 


us 


2003027269 


us 


2003027270 



Form PCT/ISA/210 (citation family annex) (July 1998) 



) 



j 



INTERNATIONAL SEARCH REPORT 
Infoimation on patent family members 



International application No. 
PCT/ATJ03/01166 



TTC 
Uo 


7007077971 


TTQ 
Uo 


9007077079 
ZUlOVJZ /Z /Z 


TTC 

Uo 


OAA0AO7O77 

Z00j0Z/Z/J 


TTC 


9007077974 


TTC 
Uo 


000000"777<C 
ZUUOUZ /Z/3 


TTC 
Uo 


OAAO AOOOO/J 

ZOOJOz/Z/O 


TTQ 

Uo 


7007097977 


TTC 
Uo 


7007077978 


TTC 
Uo 


OOO0AO7O70 

ZUlOUZ/ z/y 


TTC 
Uo 


9007097980 


TTC 
Uo 


90070979 R1 
ZUV.JUZ/ZoI 


TTC 
Uo 


0AA7A0770/1 

ZU030Z / d/A 


TTC 
Uo 


70070977^/1 


TTC 
Uo 


9007097QR^I 


TTC 
Uo 


0OA0A97QC£ 


TTC 
Uo 


7001097009 
ZwJvZ/yj'Z 


TTC 
Uo 


7007097007 


TTC 
Uo 


OAA7A7OAO0 

ZOOjOjZUZj 


TTC 
Uo 


70070790^7 


TTC 
Uo 


ZUIA3UJZUOZ 


TTC 
Uo 


OAAO AOO A/CO 

2003032063 


TTC 

Uo 


70070*391 01 
ZUUjVjZlUl 


TTC 
Uo 


OAAOAOO 1 AO 
ZUUo03ZlUZ 


TTC 

US 


OAAO AOO 1 AO 

2U03032103 


Uo 


7007A771 O/l 
ZUUjUjZIIW 


TTC 
US 


OAA7A0 01 A< 

20U303211D 


TTC 

us 


OAAO AO0 1 A/C 

2003032106 


Uo 


OAAOAOO 1 AO 

zOOiOJzlO I 


TTO 
US 


OAAOAOO 1 AO 

2003032108 


TTO 

US 


OAAOAOO t AA 

2003032109 


TTO 

US 


2003032I10 


TTO 

US 


2003032111 


T TO 

us 


OAAOAOOI 1 O 

20030321 12 


TTO 

US 


*%AA1 /\0 O 1 1 1 

2003032113 


T TO 

us 


2003032114 


T TO 

us 


OAAOAOOI 1 C 

2003032115 


TTO 

US 


20030321 16 


T TO 

US 


OAAOAOOI 1*7 

2003032117 


T TO 

us 


OAAO AO 0 1 -4 ft 

2003032H8 


T TO 

Uo 


2003032119 


T TO 

US 


2003032120 


us 


AAA') A^tllf 1 

2003032121 


TTO 

Uo 


'"iAA'JA'iOl '1'** 

2003032122 


TTO 

US 


OA AO AO O 1 OO 

2003032123 


us 


2003032124 


TTO 
US 


OAA'iAI'tl OC 

2003032125 


T TO 

US 


OA AO AOO 1 OZT 

2003032126 


TTO 

us 


OAAOAOO 1 OO 

2003032127 


TTO 

US 


OAAOAOO lOO 

2003032128 


T TO 

US 


OAAO AO O 1 OA 

2003032129 


T TO 

us 


OAAO AOO ItA 

2003032130 


T TO 
US 


OAAOAIOl O 1 

2003032131 


T TO 

US 


OA AO AO Ol OO 

2003032132 


TTO 

US 


OAAO AO0 1 O O 

2003032133 


TTO 
US 


2003032134 


TTO 

US 


O A AO. AO 0 1 O C 

2003032135 


TTO 

US 


OAAOAOO 10/ 

2003032136 


TTO 

US 


2003032137 


TTO 

US 


OAAO A0O1 OO 

2003032138 


TTO 

US 


OAAOAOO 1 OA 

2003032139 


TTO 

US 


'lAAIAI'il Af\ 

2003032140 


TTO 

US 


OAAOAOOI CC 

2003032155 


TTO 

US 


OAA1 AOOI CZ* 

2003032156 


Uo 


2003034993 


TTO 
US 


OAAO AO /C1 1 O 

200303 ol 17 


TTO 
US 


OAAOAOO"! 1 O 

2003036118 


T TO 

US 


2003036 120 


TTO 

US 


OAAO AO/CIOI 

2003036121 


TTO 
US 


OAAO AO C 1 OO 

2003036122 


TTC 

Uo 


2003036123 


TTO 

US 


OAAO AO /CI O/l 

2003036124 


FTO 

US 


OAAO AO^T OC 

2003036125 


TTC 

Uo 


OAAOAOiCI O/C 

2003036126 


TTO 

US 


OAAO AO /C1 OO 

200303612/ 


TTO 

US 


OAAOAO/T1 OO 

2003036128 


TTC 
US 


2003036 I 29 


TTO 

US 


OAA0A0/C1 OA 

2003036130 


TTO 

US 


OAAO AOiTI O 1 

2003036131 


TTC 

US 


O AA"5 A1 / HO 

2003036132 


TTO 
US 


OAAOAOOI OO 

2003036133 


T TO 

US 


OAAOAOOI O A 

2003036134 


TTO 

US 


2003036135 


TTO 
US 


OAAO AO C\ 

2003036136 


TTO 

US 


OAAOAOOI O "7 

2003036137 


T TO 
US 


2003036138 


TIO 

US 


OAAO AO CI OA 

2003036139 


TTO 

US 


OAAOAOOI A A 

2003036140 


US 


2003036141 


US 


2003036142 


us 


2003036143 


US 


2003036144 


US 


2003036145 


us 


2003036146 


us 


2003036147 


us 


2003036148 


us 


2003036149 


us 


2003036150 


us 


2003036151 


us 


2003036152 


us 


2003036153 


us 


2003036154 


us 


2003036155 


us 


2003036156 


us 


2003036157 


us 


2003036158 


us 


2003036159 


us 


2003036160 


us 


2003036161 


us 


2003036162 


us 


2003036164 


us 


2003036165 



Form PCT/IS A/2 ! 0 (citation family annex) (July 1 998) 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 



US 


2003036179 


us 


2003036180 


us 


2003036634 


us 


2003038826 


us 


2003038827 


us 


2003038828 


us 


2003039648 


us 


2003040013 


us 


2003040014 


us 


2003040054 


us 


2003040055 


us 


2003040056 


us 


2003040057 


. us 


2003040058 


US 


2003040059 


us 


2003040060 


us 


2003040061 


US 


2003040062 


us 


2003040063 


us 


2003040064 


US 


2003040065 


us 


2003040066 


us 


2003040067 


US 


2003040068 


us 


2003040069 


us 


2003040070 


US 


2003040071 


us 


2003040072 


us 


2003040073 


US 


2003040074 


us 


2003040075 


us 


2003040076 


US 


2003040077 


us 


2003040078 


us 


2003040473 


us 


2003043176 


us 


. 2003044806 


us 


2003044841 


us 


2003044844 


us 


2003044902 


us 


2003044916 


us 


2003044917 


us 


2003044918 


us 


2003044919 


us 


2003044920 


us 


2003044921 


us 


2003044922 


us 


2003044923 


us 


2003044924 


us 


2003044925 


us 


2003044926 


us 


2003044927 


us 


2003044928 


us 


2003044929 


us 


2003044930 


us 


2003044931 


us 


2003044932 


us 


2003044945 


us 


2003045462 


us 


2003045463 


us 


2003045684 


us 


2003049633 


us 


2003049638 


us 


2003049681 


us 


2003049682 


us 


2003049684 


us 


2003049735 


us 


2003049738 


us 


2003049741 


us 


2003049756 


us 


2003049816 


us 


2003049817 


us 


2003050239 


us 


2003050240 


us 


2003050241 


us 


2003050457 


us 


2003050462 


us 


2003054359 


us 


2003054403 


us 


2003054404 


us 


2003054405 


us 


2003054406 


us 


2003054442 


us 


2003054454 


us 


2003054455 


us 


2003054456 


us 


2003054458 


us 


2003054459 


us 


2003054460 


us 


2003054468 


us 


2003054516 


us 


2003054517 


us 


2003054986 


us 


2003054987 


us 


2003055216 


us 


2003055222 


us 


2003059780 


us 


2003059782 


us 


2003059783 


us 


2003059831 


us 


2003059832 


us 


2003059833 


us 


2003059876 


us 


2003059909 


us 


2003060406 


us 


2003060407 


us 


2003060600 


us 


2003060601 


us 


2003060602 


us 


2003064062 


us 


• 2003064375 



) 

International application No. 
PCT/AU03/01166 



Form PCT/1SA/2 10 (citation family annex) (July 1998) 



0 


.. j 


INTERNATIONAL SEARCH REPORT 
Information on patent family members 




International application No. 
PCT/AU03/0U66 



TTO 


*> A AO C\CA A IY7 

2003Uo44v// 


TTO 

Uo 




TTO 

Uo 




Uo 


OA AO f\CA OO ^ 


TTO 

Uo 




TTO 

Uo 


O AAO A#>< 1 /17 


TTO 


2003UOO/93 


TTO 

Uo 


1 A A 1 A*£ Q 7 0/l 

zUU3UOo fyH 


TTO 

Uo 


OAA3A<C7Q^ 
ZUUjUOo/sO 


Uo 


OAA0A£Q7O£ 


TTO 

Uo 


OAAOA#Cfi707 


TTO 
Uo 


7AAOA/C97G8 


TTO 

Uo 


OAAO A/C0 1 *70 

20030691 /8 


TTO 

Uo 


OAA"2A/C01 70 

2003Uoyi fy 


TTO 
Uo 


O AA Q A AGO QA 


T TO 

US 


O A AO AJTA /I AO 

2003069403 


T TO 

Uo 


OAAOA*71 

20030 /lo3:> 


TTO 

Uo 


0AAQA707/K 


T TO 

US 


2003073090 


T TO 

Uo 


20U3073 1Z9 


T TO 

Uo 


OAA1AT1 1 1f\ 

ZUU30/313U 


T TO 

US 


2003073131 


T TO 

us 


OAAOAIO 1 CVi 

2003073169 


TTO 

Uo 


OAAO AT -5 1 *7A 

2003073 I/O 


us 


2003073171 


f TO 

us 


/inrto A*70 1 TO 

2003073172 


TTO 
US 


AAAOAT1 1 "70 

2003073173 


us 


2003073174 


us 


2003073177 


T TO 

US 


OAA1AT>AAO 

2003073208 


us 


2003073210 


us 


lAAIAffll 1 

2003073211 


T TO 

US 


2003073212 


us 


2003073213 


us 


2003073214 


T TO 

US 


AAA1ATJ01 C 

2003073215 


us 


2003073216 


us 


2003073624 


us 


OAAO AT) OAA 

2003073809 


us 


2003073810 


us 


2003073813 


US • 


2003073821 


us 


2003077593 


us 


2003077594 


T TO 

us 


2003077659 


us 


2003077700 


us 


2003077718 


us 


2003077732 


us 


2003077733 


us 


AAA"! AOTI'7/' 

2003077776 


T TO 

us 


OA AO A11T71 

2003077777 


us 


2003077778 


us 


^AA1ATTT7A 

2003077779 


TTO 

us 


OA AO A"7T70 A 

2003077780 


us 


2003077781 


us 


2003077782 


TTO 

US 


OA AO ATTfOI 

2003077783 


us 


2003077784 


us 


2003077785 


T TO 

US 


oa ao Anno /T 

2003077786 


us 


2003077787 


us 


2003077788 


T TO 

US 


O AAO ATIIOA 

2003077789 


TTO 

us 


2003077790 


T TO 

us 


AAA1A7mA1 

2003077791 


TTO 
US 


2003077792 


us 


2003078377 


T TO 

us 


OAAH A OO 1 AA 

2003082199 


TTO 

US 


O A AO A OO C A C 

2003082M6 


us 


2003082626 


TTO 

us 


2003082627 


TTO 
US 


O AAO AOO^O O 

200308ZoZ5 


TTO 

us 


2003082689 


TTO 

us 


OAAOAOOOI C 

2003082715 


TTO 

Uo 


2003082 / 16 


T TO 

us 


2003082717 


T TO 

us 


2003082759 


TTO 

Uo 


OA AO AOOO^CA 

2UU3 082760 


us 


2003082761 


TTO 

us 


2003082762 


TTO 

Uo 


OA AO AOOO/TO 

2003082763 


TTCt 

us 


2003082764 


TTO 

us 


2003082765 


TTO 

Uo 


OA AO AOOT^iT 

2003082766 


us 


2003082767 


T TO 

us 


OAAOAOOOvIO 

2003083248 


TTO 

US 


OA AO AOO A C 1 

2003083461 


us 


2003083462 


us 


2003087304 


US 


2003087305 


us 


2003087344 


us 


2003087345 


US 


2003087365 


us 


2003087367 


us 


2003087373 


us 


2003087374 


us 


2003092061 


us 


2003092104 


us 


2003092147 


us 


2003092883 


us 


2003096353 


us 


2003096386 


us 


2003096744 


us 


2003096954 


us 


2003096955 


us 


2003099625 


us 


2003100061 


us 


2003100087 


us 


2003104536 


us 


2003104538 


us 


2003104539 



Form PCT/ISA/21 0 (citation family annex) (July 1998) 



o 

INTERNATIONAL SEARCH REPORT 
Information on patent family members 



US 


2003104540 


US 


2003104541 


US 


2003104542 


us 


2003104543 


US 


2003104998 


US 


2003109672 


us 


2003113793 


US 


2003113795 


us 


2003114652 


us 


2003119001 


US 


2003119055 


us 


2003119097 


us 


2003119105 


us 


2003119106 


us 


2003120053 


us 


2003124531 


. us 


2003124661 


us 


2003125535 


us 


2003129650 


us 


2003129695 


us 


2003130181 


us 


2003130182 


us 


2003130483 


us 


2003130490 


us 


2003130491 


us 


2003134284 


us 


2003134785 


us 


2003135034 


us 


2003138882 


us 


2003138898 


us 


2003139328 


us 


2003139329 


us 


2003148373 


us 


2003148375 


us 


2003148376 


us 


2003153000 


us 


2003157635 


us 


2003166104 


.us 


2003166105 


us 


2003166106 


us 


2003166107 


us 


2003166108 


us 


2003166109 


us 


2003166110 


us 


2003166111 


us 


2003166112 


us 


2003166113 


us 


2003166114 


us 


2003166115 


us 


2003166117 


us 


2003166118 


us 


2003166119 


us 


2003166120 


us 


2003166121 


us 


2003166122 


us 


2003166123 


us 


2003166124 


us 


2003166125 


us 


2003166126 


us 


2003166127 


us 


2003166128 


us 


2003166129 


us 


2003166130 


us 


2003166131 


us 


2003166133 


us 


2003166134 


us 


2003166135 


us 


2003166188 


us 


2003166848 


us 


2003170718 


us 


2003170721 


us 


2003175882 


us 


2003180311 


us 


2003180792 


us 


2003180793 


us 


2003180794 


us 


2003180795 


us 


2003180796 


us 


2003180836 


us 


2003180837 


us 


2003180838 


us 


2003180839 


us 


2003180840 


us 


2003180841 


us 


2003180842 


us 


2003180843 


us 


2003180844 


us 


2003180845 


us 


2003180846 


us 


2003180847 


us 


2003180848 


us 


2003180849 


us 


2003180850 


us 


2003180851 


us 


2003180852 


us 


2003180853 


us 


2003180854 


us 


2003180855 


us 


2003180856 


us 


2003180857 


us 


2003180858 


us 


2003180859 


us 


2003180860 


us 


2003180861 


us 


2003180862 


us 


2003180863 


us 


2003180867 


us 


2003180904 


us 


2003180905 


us 


2003180906 


us 


2003180907 



0 

International application No. 
PCT/AU03/01166 



Form PCT/ISA/210 (citation family annex) (July 1998) 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 



International application No. 
PCT/AU03/01166 



TTQ 


ZUUJ 1 oi/yuo 


TTC 
UO 


onni 1 raqao 
zuu j i oxjyyjy 


TTC 

Uo 


9AA3 1 OAG1 A 

zuuJiouy iu 


T TO 
Uo 


onn^ 1 52001 1 
zuuj i ovy i i 


TTC 

uo 


OAAll RAQIO 


TTC 

Uo 


oaai i sag 1 i 

ZUUJ 1 cAJy i J 


TTO 
UO 


ZUUJ 1 OXJzr l*t 


TTC 
UO 


9 AO** IRAQIS 


TTC 
Uo 


9 RA0 1 £ 
ZUUJ 1 oU j 1 0 


TTC 


9fVm RAQ17 


TTC 
UO 


OAA*51 CAQ1 O 


TTC 
UO 


9AA"*1 PAQI 0 
ZUUJloU^ ly 


TTC 
uo 


zuuj i ouyzu 


TTC 
UO 


90A^ 1 8 AQ9 1 

ZUU j 1 OUSrZ i 


TTC 
UO 


9 AA7 1 SA099 

zuujiouyzz 


TTC 
Uo 


zuuj i ouyzj 


TTC 
UO 


OAA0 1 OliCO/: 
ZUUJ lOlOJO 


TTC 
Uo 


9AA11 01 6.11 
ZUUJ IB 10 J/ 


nc 

Uo 


9AAQ1 ci«s 

ZUUJioiOJo 


TTC 
Uo 


9AA1 1 81 AlO 

ZUUJ loiojy 


TTC 

Uo 


OAAOI 01 AA A 

ZUUJ 18 164U 


TTO 

Uo 


9AAH ci ^ i 

ZUUJ 101041 


TTC 
Uo 


9A A** 1 fi 1 6A0 

zuuj lis io4Z 


TTC 

Uo 


OAA11 Q1 tZAI 
ZUUJ lo 104 J 


Uo 


ZUUJ181o44 


TTC 

Uo 


OAAO 1 01 C 

ZUUJlolo45 


TTC 

Uo 


oaao 1 oi a a a 
2003181646 


¥ TO 

Uo 


OAA01 01 UAH 

2003181647 


TTC 

Uo 


OAAO 1 Ol iC/l O 

2003181648 


TTC 

us 


OAAO 1 Ol £ A A 

2003181649 


TIC 

Uo 


ZUU3181ojO 


TTC 

Uo 


OAAO 1 Ol CC 1 

2003181651 


TTC 

us 


OAAO 1 Ol £ C\ 

2003181652 


TTO 

Uo 


OAAO 1 O 1 

2003 181 653 


TTO 

us 


OAAO IOI £.C A 

2003181654 


TTO 

US 


2003181655 


TTC 

Uo 


OAAO 1 Ol £C/T 

2003181656 


T TO 

us 


OAAO 1 Ol /f7 

2003181657 


US 


2003181666 


T TO 

Uo 


2003181667 


T TO 

us 


i 

OAAO 1 Ol Zf^O 

2003181668 


us 


2003181669 


TTC 

Uo 


OA AO 1 Ol £Ort 

2003181670 


TTC 

US 


OAAO 1 Ol £71 

2003181671 


TTO 

us 


OAAO t O-f /T^ 

2003181672 


TTC 

Uo 


OAAO 101 /ZT7 

2003181673 


TTC 

Uo 


OAAO 1 0 1 /4 

2003181674 


TTO 

US 


OAAO t Ol £*1£ 

2003181675 


T TO 

Uo 


OAAO 1 Ol 

2003181676 


T TO 

US 


2003181677 


TTO 

US 


2003181678 


TTC 

Uo 


OAAO 101 i£OA 

20031 81 679 


TTO 

Uo 


OAAO 1 01 /'OA 

2003181680 


TTO 

US 


AAAO % 0 1 /*r> 1 

2003181681 


T TC 

Uo 


OAA3 101 ifOO 

2003181682 


TTO 

US 


OAAO 1 O 1 £00 

2003181683 


TTO 

US 


OAAO 101 fO A 

2003181684 


TTC 
UO 


OAA1 101 /CO^ 

2003181685 


TTC 

Uo 


OAAO 1 01 £LC\C 

2003181696 


TTO 

US 


OAAO 1 ot /ni 

2003181697 


TTC 

Uo 


OAAO IOI iTflO 

20031816V8 


TTC 

Uo 


OAAO 1 Ol £C\C\ 

2003181699 


TTO 

US 


OAAO 1 OT TAA 

2003181700 


TTC 

Uo 


OA AO 101 OA 1 

2003181 /Ol 


TTC 

Uo 


OAAO 1 O 1 OAO 

2003181702 


TTO 

US 


OAAO 1 O 1 OAO 

2003181703 


TTC 

Uo 


OAAO 1 OCC/t 1 

ZUUJ 185 841 


TTC 
US 


OAAO 1 O^O 1 O 

2003186318 


TTO 

US 


OAAO 1 o 1 rt 

2003186319 


TTC 

Uo 


9AA7 1 Q£.1£Z 

ZUUJIoOJOj 


TTC 


OAAO 1 O/TO/TO 

2003 1 86368 


TTO 

US 


OAAO 1 0<T>f AT 

2003186407 


Uo 


ZUUJ 1 oo**Uo 


TTC 

Uo 


OAAO 1 571 OO 

Z003 18/189 


TTC 

Uo 


OAAO 1 OO* 1 A A 

2003187190 


TTC 
Uo 


9AA71 CT1 Ol 

zuujio/i9i 


TTC 

Uo 


OAAO 1 Ol 1 OO 

200318/192 


TTC 

Uo 


OAAO 1 OH 1 AO 

2003187193 


TTC 

Uo 


onnn C7i o< 
ZUUJlo/195 


TTC 

Uo 


OAAO 1 OOl C\C 

2003187196 


TTO 

US 


OAAO 1 Oil AO 

2003187197 


TTC 

Uo 


OAAO 1 COOOO 

2003 1 6 /ZZo 


TTC 

Uo 


OAAO 1 OOOOA 

20U3 187229 


TTC 
US 


OAAO 1 OOOOA 

2003187230 


T IC 

Uo 


OAAO 1 OOOOQ 

ZU03 18/239 


TTC 

Uo 


OAAO 1 Oil A 1 

2003187241 


TTO 

US 


OAAO 'I OOO A O 

2003187242 


us 


2003190321 


US 


2003190667 


US 


2003190668 


us 


2003190669 


US 


2003190698 


us 


2003190699 


us 


2003190701 


US 


2003190703 


us 


2003190716 


us 


2003190717 


US 


2003190718 


us 


2003190719 


us 


2003190720 


US 


2003190721 


us 


2003190722 


us 


2003190723. 


US 


2003190724 


us 


2003190725 


us 


2003190726 


US 


2003190727 


us 


2003190728 


us 


2003190729 


US 


2003190730 


us 


2003190731 



Form PCT/ISA/210 (citation family annex) (July 1998) 



O 0 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 


International application No. 




PCT/AU03/01166 







us 

\J L7 


' 2003191281 


US 


2003191282 


US 


2003191283 






US 


2003191284 

&VVJ M.Jr 1a(tT 


US 


2003191287 


us 


2003191282 






US 


2003191289 


US 


2003191290 


US 

wo 


2003191299 






us 


2003194744 


us 


2003194780 


us 


7003194781 






us 


2003194791 


us 


2003194792 


us 


2003194793 






us 


2003194794 


us 


2003195148 


us 


2003195333 






us 


2003195334 


us 


2003195344 


us 


2003195345 






us 


2003195347 


wo 


0032221 


wo 


0037638 






wo 


0037640 


wo 


0053752 


wo 


0053753 






wo 


0053755 


wo 


0053756 


wo 


0053757 






wo 


0053758 










WO 


' 0170976 


AU 


49317/01 


us 


2002048759 


us 


2003059781 


WO 


03068054 














US 


2002192678 














WO 


0197SR1 


ATT 


6^296/01 






X*Jt 








US 


9007117717 










WO 


AK1 C1Q 
VlJlJ U 


ATT 


1^470/01 










wu 


f\1Q0304 


ATT 


zzoi/o/ui 


ATT 
AU 


TOO* 8/01 


ATT 
AU 








ATT 
AU 


1£4£0/01 


ATT 
AU 


q/M<l /ni 


ATT 
AU 


3/^4^7/01 






ATT 
AU 


7£4/TC/01 
JO*fOj(Ul 


ATT 
AU 


1A464/01 


ATT 

AU 


1£4fi*/01 






ATT 
AU 


1£4fi£/01 
JjO^tOO/Ul 


ATT 
AU 


*5 704*3/01 


ATT 

AU 


17044/01 






ATT 
AU 


Q7Q47/01 


ATT 
AU 


17Q40/01 


ATT 

AU 


*i7Qcn/m 
0 /yj\j/vi 






ATT 
AU 




ATT 
AU 


170*7/01 


ATT 

AU 


370^3 /01 






ATT 
AU 


170*4/01 


ATT 
AU 


•370<</ni 

j /lOj/Ul 


ATT 

AU 


T7QV7/01 

j /20 i/Vx 






ATT 
AU 


q70co/m 
j /yoo/vi 


ATT 
AU 


IR^R^/OI 


ATT 

AU 


3Q776/01 






ATT 

AU 


-20777/01 


ATT 
AU 


1Q778/01 


ATT 
AU 


41409/01 






ATT 
AU 


41403/01 


ATT 

AU 


41404/01 


ATT 
AU 


41405V01 






ATT 
AU 


4140^/01 


ATT 
AU 


41407/01 


ATT 
AU 


4140R/01 






ATT 

AU 


41/100/01 


ATT 
AU 


41410/01 


ATT 

AU 


41411/01 






AU 


41412/01 


AU 


41413/01 


AU 


41414/01 






AU 


41415/01 


AU 


41416/01 


AU 


41417/01 






AU 


41418/01 


AU 


41419/01 


AU 


43134/01 






AU 


43135/01 


AU 


43136/01 


AU 


43137/01 






AU 


45262/01 


AU 


45354/01 


AU 


47190/01 






AU 


47191/01 


AU 


49052/01 


AU 


49053/01 






AU 


49054/01 


AU 


50767/01 


AU 


50768/01 






AU 


50769/01 


AU 


50770/01 


AU 


50771/01 



Form PCT/ISA/210 (citation femily annex) (July 1998) 



•:. } 

INTERNATIONAL SEARCH REPORT 
Information on patent family members 






) 






International application No. 
PCT/AU03/01166 


AU 


50772/01 


AU 


52878/01 


AU 


52879/01 


AU 


55162/01 


AU 


57912/01 


AU 


60969/01 


AU 


62899/01 


AU 


74888/01 


CA 


2392398 


CA 


2392422 


CA 


2392428 


CA 


2392438 


CA 


2392450 


CA 


2392751 


CA 


2392757 


CA 


2393002 


CA 


2393616 


CA 


2393618 


CA 


2393652 


CA 


2393912 


CA 


2393941 


CA 


2393954 


CA 


2394022 


CA 


2394039 


CA 


2394841 


CA 


2395178 


CA 


2395295 


CA 


2395398 


CA 


2395403 


CA 


2395654 


CA 


2395666 


CA 


2395671 


CA 


2395676 


CA 


2395693 


CA 


2395699 


CA 


2395724 


CA 


2395729 


CA 


2395734 


CA 


2395738 


CA 


2395787 


CA 


2395794 


CA 


2395811 


CA 


' 2395815 


CA 


2395816 


CA 


2395827 


CA 


2395838 


CA 


2395849 


CA 


2395857 


CA 


2395858 


CA 


2395872 


CA 


2395885 


CA 


2395889 


CA 


2396719 


CA 


2397407 


CA 


2397502 


CA 


2397839 


CA 


2398227 


CA 


2398275 


CA 


2398411 


CA 


2398877 


EP 


1251863 


EP 


1252176 


EP 


1252185 


EP 


,1252289 


EP 


1252290 


EP 


1252297 


EP 


1252302 


EP 


1252303 


EP 


1252312 


EP 


1252326 


EP 


1252337 


EP 


1254147 


EP 


1254148 


EP 


1254150 


EP 


1254151 


EP 


1254152 


EP 


1254153 


EP 


1254154 


EP 


1254157 


EP 


1254165 


EP 


1254170 


EP 


1254171 


EP 


1254172 


EP 


1254173 


EP 


1254217 


EP 


1254218 


EP 


1254219 ! 


EP 


1254228 


EP 


1254242 


EP 


1254248 


EP 


1254272 . 


EP 


1255766 


EP 


1255767 


EP 


1255768 


EP 


1255776 


EP 


1255777 


EP 


1255778 


EP 


1255817 


EP 


1255864 


EP 


1255869 


EP 


1259525 


EP 


1259526 


EP 


1259528 


EP 


1259531 


EP 


1259540 


EP 


1259642 


EP 


1261380 


EP 


1261618 


EP 


1261619 


EP 


1261633 


EP 


1261634 



Form PCT/ISA/210 (citation family annex) (July 1998) 



INTERNATIONAL SEARCH REPORT 
Information on patent family members 



International application No. 
PCT/AU03/01166 



EP 


1261635 


EP 


1261637 


JBP 


1 0£1 *7fY* 

lzol /Ud 


EP 


i ozr i *T>f o 

1261742 


T3T> 

tlr 


1261745 


tjtj 




EP 


1265910 




12ooj27 


Uo 


OAAOA/IOAQ/C 


US 


2002042386 


TTC 


2002061521 


TTC 
Ub 


o a ao (\T7*\H A 
ZUOZO//Z/U 


US 


2002086330 


TTC 

Us 


2002086353 


TTC 
US 


OAAOAO/CQ1 1 

20020oooll 


T TO 

US 


2002086820 


Uc> 


or\no/\o^oo i 

200208 ooZl 


TTC 


TAAOAQ/CQOO 
ZOOZOoOoZZ 


US 


2002086823 


U£> 


OAA1AAA/1 C 

2002090615 


TTC 

US 


OA AT AAA*C70 

20020906 /Z 


us 


2002090673 


Tin 

US 


2002090674 


TTC 
US 


OAA^AA/IACO. 

20020949!) i 


us 


2002102638 


r to 

us 


oaa^ i i aa i a 

2002119919 


TTC 

us 


O AAO 1 OA 1 A1 

2002120103 . 


us 


2002132753 


us 


2002132767 


TTC1 

US 


2002147140 


us 


2002151009 


us 


2002151479 


us 


AAAA ■« st\ >< no 

2002160493 


us 


2002161208 


us 


2002164685 


us 


2002165137 


us 


2002168711 


us 


2002173454 


us 


AAfn A11/4A 

2003013649 


us 


2003039993 


us 


2003039994 


us 


AAA1A>I A OAA 

2003044890 


us 


2003044904 


us 


2003044905 


us 


2003044907 


us 


2003049650 


us 


2003049652 


us 


2003049703 


us 


2003050231 


us 


2003054368 


us 


2003054373 


us 


2003054375 


us 


2003054377 


us 


2003054379 


us 


2003054420 


us 


2003059875 


us 


lAA^fVf AAAD 

2003059908 


us 


2003077602 


us 


nAAIAII /•«/" 

2003077606 


us 


2003077703 


us 


2003077704 


us 


2003077808 


us 


2003082681 


us 


2003082758 


us 


r\r\r\"t A AO 1 AA 

2003092102 


TTC1 

us 


OAA">AAn/f1 1 

200309261 1 


us 


2003092615 


us 


oaaoaa^o 4 y 

2003096346 


us 


2003108907 


us 


2003125246 


us 


2003139327 


us 


oaai 1 nc AO 

2003157508 


us 


2003175739 


us 


2003190707 


wo 


0154472 


wo 


0154473 


wo 


0154474 


wo 


A1 C A HAD 

0154708 


wo 


0154733 


wo 


0155162 


wo 


0155163 


wo 


0155164 


wo 


0155167 


wo 


0155168 


wo 


0155173 


wo 


f\-i t~ f AAA 

0155200 


wo 


A1 CCO At 

0155201 


wo 


0155202 


wo 


0155203 


wo 


0155204 


wo 


0155205 


wo 


0155206 


wo 


0155207 


wo 


0155208 


wo 


0155300 


wo 


0155301 


wo 


0155302 


wo 


0155303 


wo 


0155304 


wo 


0155305 


wo 


0155306 


wo 


0155307 


wo 


0155308 


wo 


0155309 


wo 


0155310 


wo 


0155311 


wo 


0155312 


wo 


0155313 


wo 


0155314 


wo 


0155315 


wo 


0155316 



Form PCT/ISA/210 (citation family annex) (July 1998) 





i 

** 


) 


INTERNATIONAL SEARCH REPORT 
Information on patent family members 








International application No. 
PCT/AU03/01166 







WO 


0155317 


WO 


0155318 


wo 


0155319 






wo 


0155320 


WO 


0155321 


WO 


0155322 






wo 


0155323 


WO 


0155324 


WO 


0155325 






wo 


0155326 


WO 


0155327 


WO 


0155328 






wo 


0155329 


WO 


0155343 


WO 


0155350 






wo 


0155355 


WO 


0155364 


WO 


0155367 






wo 


0155368 


WO 


0155387 


WO 


0155388 






wo 


0155430 


wo 


0155440 


WO 


0155441 






wo 


0155447 


wo 


0155448 


WO 


0155449 






wo 


0157182 


wo 


0159063 


WO 


0159064 






wo 


0162789 


wo 


0164703 






WO 


9953040 


DE 


19817557 


EP 


1073727 


EP 


1291422 






us 


2003105315 










wo 


0194629 


AU 


64559/01 


CA 


2411601 


US 


2002081301 






US 


2002102531 


US 


2002102532 


US 


2002110821 






US 


2002115057. 


US 


2002115085 


us 


2002150877 






us 


2002165180 


US 


2003165839 


* 




wo 


9848010 


AU 


70139/98 


EP 


0979277 


us 


2002192160 


wo 


0160860 


AU 


41541/01 










wo 


0155313 


AU 


29508/01 


AU 


30958/01 


AU 


36445/01 






AU 


36459/01 


AU 


36460/01 


AU. 


36461/01 






AU 


36462/01 


AU 


36463/01 


AU 


36464/01 






AU 


36465/01 


AU 


36466/01 ■ 


AU 


37943/01 






AU 


37944/01 


AU 


37947/01 


AU 


37949/01 






AU 


37950/01 


AU 


37951/01 


AU 


37952/01 






AU 


37953/01 


AU 


37954/01 


AU 


37955/01 






AU 


37957/01 


AU 


37958/01 


AU 


38585/01 






AU 


39726/01 


AU 


39727/01 


AU 


39728/01 






AU 


41402/01 


AU 


41403/01 


AU 


41404/01 






AU 


41405/01 


AU 


41406/01 


AU 


41407/01 






AU 


41408/01 


AU 


41409/01 


AU 


41410/01 






AU 


41411/01 


AU 


41412/01 


AU 


41413/01 






AU 


41414/01 


AU 


41415/01 


AU 


41416/01 






AU 


41417/01 


AU 


41418/01 


AU 


41419/01 






AU 


43134/01 


AU 


43135/01 


AU 


43136/01 






AU 


43137/01 


AU 


45262/01 


AU 


45354/01 






AU 


47190/01 


AU 


47191/01 


AU 


49052/01 



Form PCJ7ISA/21 0 (citation family annex) (July 1998) 



0 .') 



INTERNATIONAL SEARCH REPORT 



Information on patent family members 


International application No. 




PCT/AU03/01166 



ATT 
AU 




ATT 

AU 


A A AC A ff\ 1 

49054/01 


AU 


50767/01 


ATT 

AU 


D\) /Oo/O I 


ATT* 

AU 


£ rV7 £f\ /A 1 

50769/01 


AU 


50770/01 


ATT 


JVf /1/01 


ATT 

AU 


C ATH /A 1 

50772/01 


AU 


52878/01 


ATT 
AU 




ATT 

AU 


C 1 A 1 T / A 1 

52917/01 


AU 


53323/01 


ATT 
AU 




ATT 

AU 


£T *"7 A 1 1 /A 1 

57912/01 


AU 


60969/01 


ATT 
AU 


/ri qoo/ai 


A TT 

AU 


66787/01 


AU 


71714/01 


ATT 

AU 


/455&/01 


CA 


11 All AO 

2392398 


CA 


2392422 


CA 


i i ai 41 o 
2 J 92428 


CA 


1-1 A1 /tl O 

2392438 


CA 


2392450 


CA 


2392751 


CA 


2392757 


CA 


2393002 


CA 


2393616 


CA 


2393618 


CA 


2393652 


CA 


2393912 


CA 


2393941 


CA 


2393954 


CA 


2394022 


CA 


2394039 


CA 


2394841 


CA 


2395178 


CA 


2395295 


CA 


2395398 


CA 


2395403 


CA 


2395654 


CA 


2395666 


CA 


2395671 


CA 


2395676 


CA 


2395693 


CA 


2395699 


CA 


2395724 


CA 


2395729 


CA 


2395734 


CA 


2395738 


CA 


2395787 


CA 


2395794 


CA 


2395811 


CA 


2395815 


S> a 

CA 


2395816 


CA 


2395827 


CA 


2395838 


CA 


2395849 


. CA 


2395857 


CA 


2395858 


CA 


11 AC OT1 

2395872 


CA 


2395885 


CA 


2395889 


CA 


2396719 


CA 


2397407 


CA 


2397502 


CA 


11 0*701 A 

2397839 


CA 


2398227 


CA 


2398275 


CA 


239841 1 


CA 


2398877 


CA 


2400638 


CA 


1 /A1 f AO 

2403508 


CA 


2406649 


EP 


1251863 


"CD 


1252176 


EP 


1252185 


EP 


1252289 


Jbr 


1 1C11AA 

1252290 


EP 


1252297 


EP 


1252302 


EP 


11C11A1 

1252303 


EP 


1252312 


EP 


1252326 


EP 


1252337 


EP 


1254147 


EP 


1254148 


EP 


1254150 


EP 


1254151 


EP 


1254152 


EP 


1254153 


EP 


1254154 


EP 


1254157 


EP 


1254165 


EP 


1254170 


EP 


1254171 


EP 


1254172 


EP 


1254173 


EP 


1254217 


EP 


1254218 


EP 


1254219 


EP 


1254228 


EP 


1254242 


EP 


1254248 


EP 


1254272 


EP 


1255766 


EP 


1255767 


EP 


1255768 


EP 


1255776 


EP 


1255777 


EP 


1255778 



Form PCT/ISA/2 10 (citation femily annex) (July 1998) 



INTERNATIONAL SEARCH REPORT 



Information on patent family members 


International application No. 




PCT/AU03/01166 



EP 


1255817 


EP 


1255864 


EP 


1255869 


EP 


1259525 


EP 


1259526 


EP 


1259528 


EP 


1259531 


EP 


1259540 


EP 


1259642 


EP 


1261380 


EP 


1261618 


EP 


1261619 


EP 


1261633 


EP 


1261634 


EP 


1261635 


EP 


1261637 


BP 


1261703 


EP 


1261742 


EP 


1261745 


EP 


1263944 


EP 


1265910 


EP 


1268527 


EP 


1268541 


EP 


1287015 


EP 


1301524 


us 


2002042096 


us 


2002042386 


US 


2002061521 


US 


2002077270 


us 


2002086330 


US 


2002086353 


US 


2002086811 


us 


2002086820 


us 


2002086821 


US 


2002086822 


us 


2002086823 


us 


2002090615 


US 


2002090672 


us 


2002090673 


us 


2002090674 


US 


2002094953 


us 


2002102638 


'.us 


2002106780 


US 


2002119919 


us 


2002120103 


us 


2002132753 


US 


2002132767 


us 


2002147140 


us 


2002151009 


US 


2002151479 


us 


2002160493 


us 


2002161208 


US 


2002164685 


us 


2002165137 


us 


2002168711 


US 


2002173454 


us 


2003013649 


us 


2003039993 


us 


2003039994 


us 


2003044890 


us 


2003044904 


us 


2003044905 


us 


2003044907 


us 


2003049650 


us 


2003049652 


us 


2003049703 


us 


2003050231 


us 


2003054368 


us- 


2003054373 


us 


2003054375 


us 


2003054377 


us 


2003054379 


us • 


2003054420 


us 


2003059875 


us 


. 2003059908 


us 


2003077602 


us 


2003077606 


us 


2003077703 


us 


2003077704 


us 


2003077808 


us 


2003082681 


us . 


2003082758 


us 


2003092102 


us 


2003092611 


us 


2003092615 


us 


2003096346 


us 


2003108907 


us 


2003125246 


1 us 


2003139327 


us 


2003157508 


us 


2003175739 


us 


2003190707 


wo 


0154472 


wo 


0154473 


wo 


0154474 


wo 


0154708 


wo 


0154733 


wo 


0155162 


wo 


0155163 


wo 


0155164 


wo 


0155167 


wo 


0155168 


wo 


0155173 


wo 


0155200 


wo 


0155201 


wo 


0155202 


wo 


0155203 


wo 


0155204 


wo 


0155205 


wo 


0155206 


wo 


0155207 



Form PCT/1SA/2I0 (citation family annex) (July 1998) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 



HI BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



UJf REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 
□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 





INES OR MARKS ON ORIGINAL DOCUMENT 



r 



